PROBABILITY AND ITS APPLICATIONS 
TO ANTISUBMARINE WARFARE 



Billy Joe SmI th 



4 



NAVAL P8 




Monterey, California 





GROUP PROJECT 

PROBABILITY AND ITS APPLICATIONS 
TO ANTISUBMARINE WARFARE 

by 

Billy Joe Smith 
and 

Larry Wayne Vice 
March 1975 . 

Thesis Advisor: Carroll 0. Wilde 

— — gaoujB— — — 1 1 II a I ■ nr— a———— 

Approved for public release; distribution unlimited. 



1167538 



security classification of this page ( Wh»n Enfrmd ) 



REPORT DOCUMENTATION PAGE 


READ INSTRUCTIONS 
BEFORE COMPLETING FORM 


\. REPORT NUMBER 


2 . GOVT ACCESSION NO. 


3. RECIPIENT'S CATALOG NUMBER 


4. Title ^«nd Submit) 

Probability and Its Applications to 
Antisubmarine Warfare 


5. TYPE OF REPORT 8 PERIOD COVERED 

Master’s Thesis 
March 1975 


6 . PERFORMING ORG. REPORT NUMBER 


7 , AUTHORf*; 

Billy Joe Smith and 
Larry Wayne Vice 


6. CONTRACT OR GRANT NUM8ERr«; 


9 . PERFORMING ORGANIZATION NAME AND ADDRESS 

Naval Postgraduate School 
Monterey, California 93940 


10. program ELEMENT. PROJECT, TASK 
AREA A WORK UNIT NUMBERS 


It. CONTROLLING OFFICE NAME AND ADDRESS 

Naval Postgraduate School 
1 Monterey, California 93940 


(2. REPORT DATE 

March 1975 


13. NUMBER OF PACES 

98 


^4. monitoring agency name a ADDRESS(I/ dJ/Zermnt from Controlling Otiice) 

Naval Postgraduate School 
Monterey, California 93940 


15. security CLASS, (ol thf roport) 

Unclassified 


15«. DECLASSI FI CATION^ DOWNGRADING 
SCHEDULE 


16. distribution statement ro/ K«pOfO 

Approved for public release; distribution unlimited. 


17 . distribution statement (oi thm mbotrmct ontorod in Block 20» it dtUmront (rom Rmport) 


16. SUPPLEMENTARY NOTES 


19 . KEY WORDS (Contlnum on rovmrtm mtdm it nocm^mmiy m\d tdontity by block numbor) 

Probability ASW Applications 

Antisubmarine Warfare 

ASW 

Operational Probability Applica tions 


20 . ABSTRACT (Continuo on rovorao »ldo It mnd Idontlty by block numbor) 

A basic course with applications of probability to ASW, the course 
consists of six lesson plans and a "Study Guide." The lesson plans 
are designed to give the/an instructor guidance in what to teach, the 
depth required and objectives the student should be able to accomplish. 
The "Study Guide" provided is for the use of both the instructor and 
the student, and it should serve as a basic text for the course. Current 



DD 1 jaN ^73 1473 edition OF I NOV 68 IS OISOLETC 

(Paere 1) s/n 0103-0M-6601 i 

' ® ' SECURITY CL ASSIFICATION OF THIS PAGE ('IFh.n 



ClCUHITY classification of this PAGEr»^«n Deim Ent^r^d' 



20 . 

Abstract continued-- 

ASW tactical publications were examined by the authors while developing 
the course, and as many of the probability applications and as much 
associated probability terminology from these sources as practicable 
(and when this could be accomplished at the '^unclassif ied" level) are 
incorporated in the course. 



DD Form 1473 (BACK) 
, 1 Jan 73 

S/N 0102-014-6601 



SECURITY CLASSIFICATION OF THIS PACEO^^n D«tm 



I 



Probability and Its Applications to Antisubmarine Warfare 



by 



Billy Joe Smith 

Lieutenant, United States Navy 
B. S., University of Oklahoma, 1966 

and 



Larry Wayne Vice 
Lieutenant, United States Navy 
B. S., University of Oklahoma, 1968 



Submitted in partial fulfillment of the 
requirements for the degree of 



MASTER OF SCIENCE IN SYSTEMS TECHNOLOGY 



from the 

NAVAL POSTGRADUATE SCHOOL 
March 1975 



5 1 I 



library 
^a\/a\ Pos 
^^ontcrcy, C • 



ABSTRACT 



A basic course with applications of probability to ASW, the course 
consists of six lesson plans and a **Study Guide.” The lesson plans are 
designed to give the/an instructor guidance in what to teach, the depth 
required and objectives the student should be able to accomplish. The 
”Study Guide” provided is for the use of both the instructor and the 
student, and it should serve as a basic text for the course. Current 
ASW tactical publications were examined by the authors while developing 
the course, and as many of the probability applications and as much 
associated probability terminology from these sources as practicable 
(and when this could be accomplished at the ”unclassif ied*' level) are 
incorporated in the course. 
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PROBABILITY AND ITS APPLICATION TO ANTISUBMARINE WARFARE 



Introduction 

One of the most important characteristics of modern ASW (Antisubmarine 
Warfare) is the existence of so many uncertainties and unknowns. Examples 
of these ’’unknowns” abound in almost every aspect of this warfare area. 

To a Tactical Coordinator on a search aircraft the ’’unknown” may be the 
evasive maneuvers an enemy submarine skipper will employ, how many of the 
sonobuoys he has aboard his aircraft are defective, or which portion of 
a search region offers the best prospects for a contact. To a CIC officer 
the presence of a submarine within detection range may be unknown because 
of excessive ambient noise; at the time a contact is gained it may be 
unknown to him whether the contact is in fact a submarine, and then the 
target location accuracy may be unknown because of uncertainties in 
underwater sound propagation. 

Because of the existence of so many unknowns, many Naval Officers who 
work in ASW are tasked with making action recommendations and action 
decisions, which are based on predictions of the unknown in the light of 
all that iS known. Therefore, success in the overall ASW effort is 
directly related to the capability within the ASW-related officer corps 
in two key areas: (i) making predictions concerning the unknown, and 

(ii) understanding the concomitant degree of accuracy and degree of 
uncertainty of such predictions, so that the best possible decisions may 
then be made. 

Probability theory provides a logical and scientific basis for predic- 
tion and decision making. It is not surprising, therefore, that 
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probabilistic concepts and the language of probability are in widespread 
everyday usage in the ASW community today. Even the basic references 
used in ASW, such as ComTac publications/ ASW handbooks. Tactical 
Reference manuals, TacAids, ASRAPS and AESPAWS, use the language and 
concepts of probability theory freely. 

Because of the widespread use of probability in the tasks assigned 
to the ASW-related Officer, his effectiveness should be enhanced by an 
increased understanding of probability theory and a better appreciation 
of its benefits in ASW. This report constitutes a "tutorial paper" 
designed to provide a means by which ASW-related officers may better 
understand application of probability in modern ASW. The objective of 

this project has been to distill from the general field of probability 

1 

those elements that are vital in ASW applications, and to describe these 
elements, along with typical examples of their application, in an 
organized and readable presentation that is accessible to any ASW-related 
Officer, with the only prerequisites being basic algebra and geometry 
as applied to tactical situations in his working environment. 

This paper has been organized into the format of an academic course, 
including lesson plans, study guides and suggested textbooks. This 
organization was adopted because it is expected that this paper will 
serve as the basis for a future short course that will be available to 
the Fleet throigh the Continuing Education Office at the Naval Postgraduate 
School. Because of the possible misunderstanding that may result from 
the "course format" adopted here, it is emphasized that the content of 
this paper does not constitute a Naval Postp;radua te School course . It is 
intended that an official course be developed from these notes; this, 
however, involves an evolutionary process, including testing of the 
materials with actual students, and subsequent revisions. 
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The Lesson Plans provide the objectives of each unit; they indicate 
what is to be learned, the depth required, and the behavioral goals. The 
Study Guides serve as the basic textbook. In addition, Schaum*s Outline 
of Theory and Problems of Probability by S. Lipschutz (1965) and Schaum* s 
Outline of Theory and Problems of Statistics by M. R. Spiegel (1961), 
both by the McGraw-Hill Book Company, are very helpful; these texts 
are inexpensive and easy to follow, as they contain many samples with 
detailed solutions. Reference is made to the appropriate sections in 
the Schaum’s texts at the end of each chapter. 

The primary material presented is the probability theory required 
for successful ASW application, and examples to illustrate the applica- 
tion. The probability theory is preceded by a presentation of sufficient 
statistical procedures to serve as "first principles" on which the 
probability is based. 
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Lesson Plan No. S-V-L Instructor: 

Time: 65 min. Subject Area: Probability and Its 

Applications to Anti- 
submarine Warfare 

Date: Topic: Data Reduction, Frequency 

Distribution 

Reference: ASW Study Guide; Schaum*s 

Outline Series, Theory 
and Problems of Statistics 



Title: "Sample Data Reduction and Frequency Distributions" 

Materials: Transparencies of Table 1, Table 2, Table 3, Figure 1, 

Figure 2, and Figure 3 

Objectives: (1) Acquaint the student with the concepts of statistical 

population and samples from the population; the 
student should be able to explain these concepts by 
means of simple examples. 

(2) Discuss the basic terms required to describe frequency 
distributions, and show mathematically how they are 
utilized. The student should be able to determine the 
frequency distribution from a given set of raw data. 

Introduction: Instructor should inform students of purpose of lesson 

and briefly highlight important points to be covered. 

Presentation: Discuss the following points: 

1. Population 

2. Sample ’ 

3. Methods of describing a sample and estimating 
characteristics of the population 

4. Raw data 

5. Arrays 

6. Classes or categories 

7. Class frequency and frequency table 

8. Class intervals and class limits 

9. Class boundaries 

10. Size or width of a class interval 
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11 . 



Class mark 



12. General rules for forming frequency distributions 

a. The range 

b. Number of class intervals 

c. Number of observations falling into each clas 
interval 

13. . Histograms and frequency polygons 

14. Relative frequency distributions 

15. Cumulative frequency distributions or ogives 

16. Relative cumulative frequency distributions 

17. Types of frequency curves 

a. Symmetrical or bell-shaped 

b. Skewed to the right 

c. Skewed to the left 

d. U-shaped 

e. Bimodal 

f. Multimodal 

Summary: Review material covered, reemphasizing key points. Ask 

questions to determine effectiveness of instruction. 
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Lesson Plan No. S-V-2 Instructor: 

Time: 115 min. Subject Area: Probability and Its 

Applications to Anti- 
submarine Warfare 

Date: Topic: Measures of Central Tendency 

and Dispersion 

Reference: ASW Study Guide; Schaum*s 

Outline Series, Theory 
and Problems of Statistics 



Title: "Special Measures of Sample Data--Central Tendency and Dispersion" 

Materials: Transparency of Figures 4 through 6 

Objectives: (1) Define central tendency and its measurement from sample 

data. Illustrate and develop the sample mean, median, 
and mode. The student shall be able to calculate the 
mean, median, and mode from given sample data. 

(2) Define dispersion and its measurement from sample data. 
Illustrate, and develop the listed measures of disper- 
sion so that the student is capable of calculating 
the range, standard deviation and variance from given 
sample data. 

Introduction: Instructor should inform students of purpose of lesson 

and briefly highlight important points to be covered. 

Presentation: Discuss the following points: 

1. Index or subscript notation 

2. Summation notation 

3. Measures of central tendency 

a. Mean 

(1) (Arithmetic) Mean 

(2) Weighted (arithmetic) mean 

b. Median 

c. Mode 

4. The concept of dispersion of data 

5. Specific measures of dispersion 

a. Range 

b. Standard Deviation 
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(1) Formulation of 

(2) Short method of computing 

c. Variance 

d. Sigma-values for Normal probability distributions 
with corresponding probability values. 

Summary: Review material covered, reemphasizing key points. Ask 

questions to determine effectiveness of instruction. 
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Lesson Plan No, S-V-3 



Instructor: 

Time: 20 min. Subject Area: Probability and Its 

Applications to Anti- 
submarine Warfare 

Date: Topic: Combinatorial Analysis 

Reference: ASW Study Guide; Schaum*s 

Outline Series, Theory 
and Problems of Statistics 



Title: ‘‘Combinatorial Analysis** 

Objective: Develop the concept of COMBINATORIAL ANALYSIS by defining and 

illustrating, with a simple example, the basic elements of 
this analysis. The student should be able to perform 
combinatorial analysis on a given example by utilizing the 
fundamental counting principle, and calculating permutations/ 
combinations to find the number of possible outcomes for 
the example. 

Introduction: Instructor should inform students of purpose of lesson 

and briefly highlight important points to be covered. 

Presentation: Discuss the following points: 

1. The fundamental counting principle 

2. Factorial n 

3. Combinations 

4. Permutations 

Summary: Review material covered, reemphasizing key points. Ask 

questions to determine effectiveness of instruction. 
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Lesson Plan No. S-V-4 



Instructor; 



Time: 150 min. 



Subject Area: Probability and Its 

Applications to Anti- 
submarine Warfare 



Da te : 



Topic: Elementary Probability Theory 



Reference: ASW Study Guide; Schaum*s 

Outline Series, Theory 
and Problems of Statistics ; 
and Schaum*s Outline Series, 
Theory and Problems of 
Probability 



Title: "Elementary Probability Theory" 

Materials: Transparencies of Figures 5 through 12 



Objectives: (1) Develop student intuition in the basic concepts of 

probability, i.e. his insight into what probability is. 
Develop the concepts of basic probability through the 
use of the proportional areas notion of probability. 

The student should be able to compute simple probabilities 
given sample data and simple statement of necessary 
conditions . 



(2) Develop the concept of CONDITIONAL PROBABILITY as an 
improved probability resulting from having previously 
eliminated part of the overall sample space to be 
"drawn on," through use of prior information. The 
student should be able to compute conditional 
probabilities utilizing any of the three listed methods. 

(3) Develop the concept of STATISTICAL INDEPENDENCE, and 
delineate between this and MUTUALLY EXCLUSIVE EVENTS. 
Also, show the utility of statistical independence of 
events in computing a simple joint probability. Given 
two events, the student should be able to state 
whether or not they are statistically independent. 

Introduction: The instructor should inform students of purpose of lesson 

and briefly highlight important points to be covered. 



Presentation: Discuss the following points: 

1. Elementary Probability Theory 



a. Background and concepts of probability 



(1) "Odds" or ratio concept 

(2) Proportional areas concept 
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b. Sample Space 

c. Probability Experiment 

d. Outcomes 

e. Events 

f. Probability Function 

g. Probability Complement 

2. Conditional Probability 

a. Concept of 

b. Methods of computing 

(1) By definition 

(2) "Stating and Solving"--"Simply writing 
it down" 

(3) Bayes* Law 

3. Statistical Independence 

a. Independent Events; Probability of Independent 
events 

b. Independent Events vs. Mutually Exclusive Events 

c. Implications/applications of Statistical Independence 

Summary: Review material covered, reemphasizing key points. Ask 

questions to determine effectiveness of instruction. 
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Lesson Plan No. S-V-5 Instructor; 

Time: 100 min. Subject Area; Probability and Its 

Applications to Anti- 
submarine Warfare 

Date: Topic: Random Variables 

Reference; ASW Study Guide; Schaum*s 
Outline Series, Theory 
and Problems of Probability 



Title: "Random Variables" 

Materials; Transparencies of Table 5, Table 6 and Figures 15 through 19 

Objectives: Introduce the concept of random variables, and develop 

random variables as a device for assigning names to the out- 
comes of a probability experiment. The student should be 
able to demonstrate how a random variable is used by means 
of examples. 

Introduction; Instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 

Presentation: Discuss the following points: 

1. Random variable 

a. discrete 

b. continuous 

c. density function 

d. cumulative distribution function (CDF) 

e . mean 

f. variance 

g. standard deviation 

Summary: Review material covered, reemphasizing key points. Ask 

questions to determine effectiveness of instruction. 
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Instructor: 



Lesson Plan No. S-V-6 

Time: 550*min. Subject Area: Probability and Its 

Applications to Anti- 
submarine Warfare 

Date: Topic: Basic Statistical Distributions 

Reference: ASW Study Guide; Schaum*s 

Outline Series, Theory 
and Problems of Statistics ; 
and Schaum's Outline Series 
Theory and Problems of 
Probability 

Title: "Basic Statistical Distributions" 

Materials: Transparencies of Table 7 and Figures 20 through 29 

Objectives: (1) Present examples of some useful statistical distributions 

(2) The student should be able to give a simple description 
of each of the distributions presented, and compute 
probabilities for Bernoulli, Binomial, Uniform, Chi- 
Square, Exponential, Poisson and Normal examples 

(with the exception of the Bivariate Normal and Time- 
Varying Normal, which are presented only for student 
appreciation of "state-of-the-art** applications). 

(3) It is intended that this lesson serve to extend the 
student* s understanding of probability functions, 
random variables, and density functions and cumulative 
density functions . 



Introduction: 



Instructor should inform students of purpose of lesson 
and briefly highlight important points to be covered. 



Presentation: Discuss the following points: 

1. Background on the basics of. specifying a statistical 
distribution, drawing the majority of this information 
directly from the previous lesson plan. 

2. Bernoulli Distribution 

3. Binomial Distribution 

4. Uniform Distribution 

5. Normal Distributions 

a. Normal Distribution 

b. Bivariate Normal Distribution 
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Time-Varying Normal Distribution 



c • 

6. Chi-Square Distribution 

7. Exponential Distribution 

8. Poisson Distribution 

Summary: Review material covered, reemphasizing key points. 

questions to determine effectiveness of instruction 



Ask 
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STUDY GUIDE 



PROBABILITY AND ITS APPLICATIONS 



TO ANTISUBMARINE WARFARE 



INTRODUCTION 



Probability theory originated in the early seventeenth century with 
investigations of various games of chance. Since then many scholars 
have made contributions to the theory. Probability theory was finally 
axiomatized in the twenties and thirties of this century. This axiomatic 
development, called Modern Probability Theory, provided for a precise 
formulation of probabilistic concepts, which enabled the theory to be 
placed on a firm mathematical foundation. 

The importance of probability has increased significantly in the 
past twenty years. Today the notions of probability, and the closely 
related subject statistics, appear in almost every discipline. Probability 
is used routinely by many Naval Officers in the line of duty, and plays 
an important role in Antisubmarine Warfare, especially in the area of 
search and detection. 

This study guide is designed for an introductory course in probability 
and applications to antisubmarine warfare, with basic algebra as the only 
prerequisite. The guide may serve as a text for such a course, as lesson 
plans are also provided, or as a supplement. In addition, since this 
study guide is complete and self-contained, it could be used for self- 
s tudy . 

Chapters I, II, and III of the study guide cover sample data reduction, 
frequency distributions, associated measures of central tendency and 
disperison, and combinatorial analysis. This leads naturally to 
elementary probability theory, conditional probability, statistical 
independence and applications, in Chapter IV. Chapter V covers random 
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variables. Discrete and continuous random variables are covered, with 
emphasis on the use of tables to determine numerical probabilities. 
Chapter VI covers the Bernoulli, Binomial, Uniform, Normal, Chi-square, 
Exponential and Poisson distributions, which are commonly used in 
antisubmarine warfare applications. Since developing understanding of. 
the applications of these distributions to ASW is the primary *'payoff** 
for this course. Chapter VI is significantly longer and has more 
applications than the first five chapters. 

Each chapter has clear statements of pertinent definitions and 
principles, together with illustrations and other descriptive material. 
This is followed by solved problems, which serve to illustrate and 
amplify the theory, bring into sharp focus those fine points without 
which the student feels himself on unsafe ground, and provide the repeti- 
tion of basic principles so vital to effective learning. 
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CHAPTER I: FREQUENCY DISTRIBUTION 



POPULATION AND SAMPLE 

In collecting data on characteristics of a group of individuals 
or objects, such as heights of students in a university, or the number 
of defective and non-defective sonobuoys produced in a factory on a 
given day, it is often impossible or impractical to observe the entire 
group, especially if it is large. Instead of examining the entire 
group, called the population , one examines a small part of the group 
called a sample . If a sample is representative of a population, important 
conclusions about the population can be reached from analysis of the 
sample. Inferences based on sample data are not absolutely certain 
because in practice it is usually not known beforehand whether the 
sample is indeed representative of the population, so the term probability 
is used in describing conclusions. The term statistics is used in 
connection with conditions under which such an inference is made. 

RAW DATA 

Raw data are collected data which have not been organized numerically. 
An example is the set of heights of 100 students obtained from Naval 
Academy records. The heights of the 100 students, with repetitions 
included, are raw data which constitute a sample , and the population 
from which the sample is taken is the set of heights of the students 
from the entire listing of the Naval Academy records. 
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ARRAYS 



An array is an arrangement of raw numerical data in ascending or 
descending order of magnitude. 

CLASSES OR CATEGORIES, CLASS FREQUENCY AND FREQUENCY TABLE 

In order to comprehend what masses of collected raw data really 
mean, and to draw conclusions from these data, it is useful to distribute 
the data into classes or categories . The number of individual data 
points belonging to a class is called the class frequency . A tabular 
arrangement of data by classes, together with the corresponding class 
frequencies, is called a frequency table . Table 1 is the frequency table 
of the heights (recorded to the nearest inch) of the 100 Naval Academy 
students . 



Keight 


Number of 


(inches ) 


Students 


( class ) 


(class frequency) 


60-- 62 


c; 


63 - 65 


18 


66 - 68 


42 


69 - 71 


27 


72 - 74 


8 




Total 100 



Table 1: Heights of 100 Ilaval Academy Students 
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CLASS INTERVALS AND CLASS LIMITS 



The symbol defining a class, such as 60 - 62 in Table 1, is called a 
class interval s The end numbers, 60 and 62, are called class limits ; 
the smaller number 60 is the lower class limit and the larger number 62 
is the upper class limit , A class interval which has no upper, or no 
lower, or neither an upper nor a lower class limit is called an open 
class interval . For example, an open class interval is appropriate for 
describing torpedoes with a range of ”10,000 yards or greater.” 



CLASS BOUNDARIES 

In Table 1 the heights are recorded to the nearest inch, and the class 
interval 60 - 62 includes all measurements from 59.5000 to 62.5000 
inches. These numbers, indicated briefly by the exact numbers 59.5 and 
62.5, are called class boundaries ; the sm.aller number 59.5 is the lower 
class boundary and the larger number 62.5 is the upper class boundary . 



SIZE OR WIDTH OF A CLASS INTERVAL 

The size or width of a class interval is the difference between the 
lower and upper class boundaries and is referred to as the class width , 
class size , or class length . For the data in Table 1, for example, the 
class width is 62.5 - 59.5 = 65.5 - 62.5 = 74.5 - 71.5 = 3. 

CLASS MARK 

The class mark is the midpoint of the class interval; it is obtained 
by adding the lower and upper class limits and dividing by two. Thus the 
class mark of the interval 60 - 62 is (60 + 62)/2 = 61. The class mark 
is also called the class midpoint . 
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GENERAL RULES FOR FORMING FREQUENCY DISTRIBUTIONS 



1. Determine the largest and smallest numbers in the raw data 
and thus find the range (the length of the interval from 
the smallest to the largest number). 

2. Divide the range into a convenient number of class intervals 
having the same size. The number of class intervals is 
usually taken between 5 and 20, depending on the data. When- 
ever possible, class intervals are also chosen so that the 
class marks or midpoints coincide with actually observed data. 
However, class boundaries usually do not coincide with 
actually observed data. 

3. Determine the number of observations falling into each class 
interval; i.e., find the class frequencies . As an example of 

a frequency distribution, consider the wages in dollars per week 
paid to enlisted personnel aboard a particular destroyer. 

Suppose the wages ranged from $50.00 to $120.00 per week 
with 8 personnel receiving wages between $50.00 and $60.00, 

10 between $60.00 and $70.00, 16 between $70.00 and $80.00, 

14 between $80.00 and $90.00, 5 between $100.00 and $110.00 
and 2 between $110.00 and $120.00. A possible formulation of 
the corresponding frequency distribution is presented in 
Table 2 with the range from $50.00 to $119.99, in 7 ten dollar 
class interval widths and the total number of personnel listed 
for each class interval. 
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(dollars ) 


Nuaber of 

E-^listed 

Personnel 


( cl3 s ses ) 


(class frequencies) 


$ 50. CO - v 59.99 


8 


$ 60.00> - 0 69.99 


10 


■5 70.00 - 0 79.99 


16 


80.00 - ? S9.99 


14 


C; 90.00 - ^ 99.99 


10 


^100. 00 - v109.99 


5 


Cl 10.00 - v119.99 


2 




Total 65 



Table 2: Freauency Distribution of the v/eekly V.'ares 

i DcH 2 .r*c oT 65 SnXistod. Psr'Hcnnol Atc^r'd. 
a Destroyer 

With reference to Table 2 determine: 

1. The lower limit of the sixth class. Ans. $100.00 

2. The upper limit of the fourth class. Ans. $ 89.99 

3. The class mark of the third class. 

Class mark of 3rd class = ^($70.00 + $79.99) = $74,995 
(For most practical purposes this is rounded to $75.00) 

4. The class boundaries of the fifth class. 

Lower class boundary of 5th class = ^($90.00 + $89.99) = $89,995 
Upper class boundary of 5th class = %($99.99 + $100.00) = $99,995 

5. The size of the fifth class interval 

Size of 5th class interval = upper boundary of 5th class - lower 
boundary of 5th class = $99,995 - $89,995 = $10.00 



HISTOGRAMS AND FREQUENCY POLYGONS 



Histograms and frequency polygons are graphical representations of 
frequency distributions. 
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1. A histogram consists of a set of rectangles having: 

(a) Bases on a horizontal axis (the X axis) with centers 

at the class marks and lengths equal to the class sizes. 

(b) Areas proportional to class frequencies. 

2. A frequency polygon is a line graph of class frequency plotted 
against class mark. It can be obtained by connecting midpoints 
of the tops of the rectangles in the histogram. 

Figure 1 shows a histogram and frequency polygon developed from the data 
listed in Table 1. 

RELATIVE FREQUENCY DISTRIBUTIONS 

The relative frequency of a class is the frequency of the class 
divided by the total frequency of all classes, and is generally 
expressed as a percentage. For example, the relative frequency of the 
class 66 - 68 in Table 1 is 42/100 = 42%. The sum of the relative 
frequencies of all classes is 1, or 100%. If the frequencies in Figure 1 
are replaced by corresponding relative frequencies, the figure is called 
a relative frequency distribution . 




Flcrure 1 : KlEto^ra::^ and Frequency Foly=^on cf Data 

Listed in Table 1 . 
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CUMULATIVE FREQUENCY DISTRIBUTIONS AND OGIVES 



The total frequency of all values less than the upper class boundary 
of a given class interval is called the cumulative frequency up to and 
including that class interval. For example, the cumulative frequency 
up to and including the class interval 66 - 68 in Table 1 is 5 + 18 + 42 
= 65, which means that 65 students have heights less than 68.5 inches. 

A table presenting such cumulative frequencies is called a cumula tive 
frequency distribution , cumulative frequency table , or briefly a 
cumulative dis tribution , and is shown in Table 3 for the Naval Academy 
student height distribution. A graph showing tne cumulative frequency 
less than any upper class boundary plotted againsc the upper class 
boundary is called a cumulative frequency polygon or og;ive ; an example 
is shown in Figure 2 for the Naval Academy student height distribution. 



Height 
(inches ) 


Number of 
Students 


less than 59.5 


0 


loss than 62,5 




less than 65.5 


23 


less than 68.5 


65 ’ 


less than 71 .5 


92 


less than 7^.5 


ICO 



Table 3*. Cumulative Frequency Distribution 

of the Naval Academy Student Height 
Distribution of Table 1 
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Figure 2: Ogive for the data^ of Table 3 

RELATIVE CUMULATIVE FREQUENCY DISTRIBUTIONS 



The relative cumulative frequency or percentag;e cumulative frequency 
is the cumulative frequency divided by the total frequency. For example, 
the relative cumulative frequency of heights of Naval Academy students 
with heights less than 68.5 inches is 65/100 = 65%, which indicates that 
65% of the students have heights less than 68.5 inches. 



TYPES OF FREQUENCY CURVES 

Many times relative frequency curves are smoothed-- that is, a smooth 
curve IS "fitted” to a relative frequency polygon--and are used to 
represent situations arising in ASW applications, such as probability of 
detection. (Tnese probability distributions are developed in Chapters IV 
through VI.) 
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Some of the more common frequency curves arising in ASW applications 



take on the characteristic shapes as indicated in Figure 3. 




SyiLme trical or Eell-Shaped 




Skewed to the Left 
(Negative £ke;;ne£s ) 





£kev;ed to the Rirht 
(Positive Skew’ness ) 





Figure 3: Types of Frequency Curves 

1. The symmetrical frequency curves are characterized by the fact 
that observations equidistant from the central maximum have the 
same frequency. An important example is the Normal curve shown 
in Chapter VI. 

2. In the skewed frequency curves the tail of the curve to one side 
of the central maximum is longer than that to the other. If the 
longer tail occurs to the right the curve is said to be skewed 
to the right or to have positive skewness, while if the reverse 
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is true the curve is said to be skewed to the left or to have 
negative skewness . 

3. A U-shaped frequency curve has a maxima at both ends. 

4. A bimodal frequency curve has two maxima. 

5. A multimodal frequency curve has more than two maxima. 

Supplementary solved problems covering arrays, frequency distributions, 

histograms, frequency polygons, cumulative frequency distributions, ogives, 
and frequency curves are presented at the end of Chapter 2 Schaum*s Out- 
line Series, Theory and Problems of Statistics . These problems should 
be consulted if a greater review of the material is desired. 
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CHAPTER II: SPECIAL MEASURES OF SAMPLE DATA: 

CENTAL TENDENCY AND DISPERSION 



MEASURES 

Although data relating to observations of events (for example, the 
number of submarine detections in a given region during a specific 
period of time) are important in developing methods for predicting 
similar events in the future, the data in the raw state are usually not 
directly suitalbe for making these predictions. The data must be 
reduced to a format that is still more manageable than the frequency 
histogram of Chapter I in order to facilitate examination further, and 
to a format from which the pattern of events on which the data are based 
can be determined. Such a representation may consist of ’’measures” of 
the data. Through the use of these measures, an entire data set can, 
effectively, be summarized by two or three single numbers. As a result, 
these measures are very important, and useful, in that they are devices 
by which sets of sample data can be quickly described. 

Two particularly useful measures of sample data are those which 
indicate or describe a relative central value of the sample data 
(central tendency), and the degree to which the data are spread out 
about a central value ( dispersion ) , A measure of central tendency can be 
utilized to indicate a ’’common quantity” which can be used to describe 
roughly the entire sample (and possibly even the entire population). 

And a measure of dispersion can be used as an indicator of the relative 
’’error” or ’’spread” that exists when a measure of central tendency is 
used to describe the sample or the population. 
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A digression will be made at this point to introduce two useful 
forms of notation, index or subscript notation ^ and summation notation . 

SUBSCRIPT NOTATION 

When it is necessary to describe a variable, X, and the values it 
can assume, and also to distinguish between individual members of the 
group (these members are specific values which the variable X can assume — 
the 1st X, the 2nd x, etc.)> subscript notation is a concise, useful means 
of describing specific values of the variable. To describe the values 
of the variable X with subscript notation, the first value (1st x) of 
the group is denoted by x^^, the second value by x^ and so forth to the 
last, say the n^^, value by x^. The entire group of values of the 
variable, X, can then by denoted by 

X = (Xj^, x^, ... , x^), 

where the three dots indicate that middle values are included in the set, 
but are not explicitly written down for brevity. 



SUMMATION NOTATION 

Summation notation is a concise and useful means of denoting the sum 

of all or part of the individual observed values of a variable X. The 

notation used to denote the sura 

x. + X + X + . . . + x 
2 d n 

is 

n 

■ 

The capital Greek letter I (sigma) denotes that a sum is to be taken over 
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the observed values of the variable x. which follow it, and the values 
of that are to be summed are those with subscripts between i = 1 and 
i = n. For example, to sum the third through the fifth observed values 
of X, the summation is written as 

5 



That is, 

5 

I + ^5 • 

i=3 

This example shows that the lower limit on i need not be 1. Further, if 
the values X can assume (say, a total of six values) are x^ = 0, ^2 ” 

X 3 = 7 , x^ = 2, X 5 = 1 , and xg = 9, then 

5 

I x^ =7+2+1= 10. 
i=3 



The following example shows how this notation can be used in an ASW 
setting. Suppose there are a total of six regions each of which can be 
monitored by a single sensor, where the number of possible targets in 
each region is known for given periods of time (from intelligence). It 
is important for the search planner to know the number of potential 
detection opportunities available from the monitored regions so that he 
can ensure that sufficient localizing/ tracking platforms (ships, aircraft, 
submarines) are on standby for employment upon initial detection. As 
the decision to monitor regions three through five has been previously 
made, he wishes to specify the number of detection opportunities available 
in these regions on a given day in a standard format regardless of the 
particular values entered in the format for a specific period; this is 
because he wishes to include this calculation in a larger computer 
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program. The notation 



5 

I y 

i=3 

is a convenient format for this purpose. In any particular run of the 
program, numerical values for x^, and must be entered. 

An important property of summation is that if a constant value, k, 
is summed over a range of N indices, then the summation of this 
quantity is equivalent to multiplying the constant k by N, since, for 
example, 

5 

I K = K+ K + K + K = 4K 
i=2 ' 

N=(5-2)+l=4 terms 



MEAN, MEDIAN, AND MODE 

Three particularly useful measures of central tendency are the mean, 
the median , and the mode . 

The mean, which is similar to the commonly used notion of arithmetic 
average, is computed by adding the observed values of sample data and 
dividing this sum by the total number (N) of data points in the sample. 

The mean can be expressed in summation notation as 

_ N 

X = 1 Z X 

Ni=l ^ 

where N is the total number of observed values summed, and X is read as 
"the mean value of X." (Note that the upper limit on the summation index 
equals the reciprocal of the constant in front of the summation.) For 
example, the mean of the observed values x^ = 2, = 5 , and x^ = 1 of x is 

3 

X = 1/3 ( Z X.) = 1/3 (2 + 5 + 1) = 8/3 . 
i=l 



17 



The utility of this value (JC) for X is that it can be taken, in many 
cases, to state a "middle value" of the observed data, and is often used 
as .the best single number estimate of the values of X. 

For example, suppose it is necessary to search for an enemy submarine 
which is currently maintaining a "known" missile launching station, 
and the following conditions exist: 

1. the "known" missile launch station is actually a large ocean 
region; 

2. the area that can be searched effectively is one-half the size of 
the missile station; 

3. the positions of previous enemy submarine detections within the 
missile launch area are available to the search planner. 

The above information is depicted in Figure 4a below, where the dots 
represent the previous detection positions. 



< L ^ 




Klssile Launch Station 
FI rure Aa 
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Searched Area 



Unsearched Area 

Mean Position 

Figure 4: Submarine Search Information: 

a. Positions of Previous Detections; 

b. Mean Search Area 

It can be seen from Figure 4a that the positions tend to cluster about a 
certain small enclosed portion of the launch region; therefore, it seems 
reasonable to search about this small area in order to increase efficiency. 
The center of this small region is the mean position of previous enemy 
submarine contacts. Therefore, to obtain the position about which the 
search should be centered, the mean of the previous positions is calculated. 
The search is subsequently centered about this mean position, as seen in 
Figure 4b. 

A special form of the mean is the weighted (arithmetic) mean . This 
form allows one to account for differing ‘^levels of importance’* of the 
individual sample observations. For example, if in a three element sample, 
were considered to be twice as important as X 2 , and x^ four times as 
important as X 2 > the weighted mean of the sample would be computed as 
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X = 



m^Xj^ + 1112X2 + ^3^3 
+ m2 + 

2x^ + 1 x 2 + 4x^ 

2 + 1 = 4 

where the is the relative weight, or level of importance, assigned 
to the i^^ element. 

The median value is the value which exactly divides the sample into 
two equal parts; there are exactly as many observations greater than the 
median value as there are less. When the number of observations is even, 
the median is exactly half-way between the two ’’center" observations, but 
for an odd number of observations the median value ^ the center observa- 
tion. 

For example, given the sample 

Xi = 2, X 2 = 5, X 3 = 1, x^ = 7 

when arrayed is 1, 2, 5, 7 and has as its median 

2 + 5 

X = 2 = 3.5 , 

and we note that this is not one of the values actually observed. 

The mode or modal value of the sample is the most frequently occurring 
value/s in the sample. If more than one value occurs at this same 
maximum frequency (of occurrence), then the sample is said to be multi - 
modal ; such a sample would have a frequency curve with more than one peak. 
For example, for the sample x^^ = 2, X 2 = 5, x^ = 2, and x^ = 3, the mode 
xs k - 2y since it occurs twice in the sample, while all other observed 
values appear once. Although the mode is easy to ascertain from a sample, 
it is probably the least useful of the measures of central tendency 
treated here. 
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It is seen that each of the three measures of central tendency 
mentioned can be used as an estimate of (or statistic which describes) 
the sampled population; however, in fact," the mean is probably the most 
useful of the three estimators-- this is so because the mean is less 
biased than the other two measures. In other words, the mean is not 
influenced by extreme sample values to the extent the median and mode 
are. Therefore, for most practical applications, the mean of a sample 
reflects the central tendency of the parent population more accurately 
than the median or the mode. 

RANGE, STANDARD DEVIATION AND VARIANCE --MEASURES OF DISPERSION 

A useful statistical indicator on sample data and its resident 
population is the spread of the sample points (usually expressed relative 
to, or about, the mean). Such an indicator is called a measure of 
dispersion (spread) . 

Three particularly useful measures of dispersion are the ran^e, the 
standard deviation , and the variance . 

The range is obtained by taking the difference between the largest 
and the smallest observed value in the sample. While the range of 
sampled data is easy to compute, it is not as accurate an indicator as 
the other two mentioned. This is due to the large degree to which one or 
two sample values (a low frequency of occurrence) at relatively extreme 
distances from the center of the greatest portion of the sample can bias 
the estimate of the actual spread (or dispersion) of the sampled 
population. 

The standard deviation (j (lower case Greek sigma) is obtained by 
first computing the sample mean, and then using the formula 
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o' = VfiT&i - x)2] . 

Ln i=l J 

Since the standard deviation tends to weight each observed value by its 

frequency of occurrence, it is seen to be a more accurate measure of 

dispersion than the range; that is, the standard deviation is not unduly 

biased by a few extreme sample values. 

The variance is found by simply squaring the standard deviation. 

Consequently, the variance has all the advantages of the standard 

deviation. The mathematical form for the variance is 

^ — 2 

^2 ^ i Z (x. - X)^ . 

U N i=l 

An equivalent form for the variance is 

, 

where 



T 1 " 2 _2 1 n 2 

= -^ I Xi and X = (jj Z X ) . 

i=l i=l 

The latter equation is the "short method" for computing and is the 

recommended method of computing the variance; the positive square root 
of this equation is the "short method" for computing (j and is recommended 
for computing the standard deviation. 

As an example which illustrates the equivalence of the two listed 
forms for computing (j^ ^ consider the case where three BT (bathythermo- 
graph) readings are observed and the first shows a sea surface temperature 
of 50^F, the second 48^F, and the third 52^F. The variance for these 
readings is first found by 
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Now, 



2 1 ” — 2 ^ — 2 
^ 1 z (Xi - X) = 1/3 I (x. - X)2 

i=l i=l 

= 1/3 [ (50 - ic)^ + (48 - X)^ + (52 - X)^] 
= 1/3 (0 + 4 + 4) = 8/3 . 




found by 






_ 2 

X , 



where 



X^ = 1/3 I = 1/3 (50^ + 48^ + 52^) = 7508/3 
i=l 



and 



x^ = (1/3 r x.)^ = [l/3 (50 + 48 + 52) ] ^ 

i=l 1 L J 

= 50^ = 7500/3, 

SO that 

(f" = 7508/3 - 7500/3 = 8/3 . 

(The second method looks longer than the first, but the mean had to be 
calculated prior to the first method.) Hence, this example illustrates 

r\ 

the equivalence of the two formulas for (j ^ * 

As an example of the usefulness of the standard deviation, consider 
the Normal probability frequency distribution (which is covered in detail 
in Chapter VI) shown in Figure 5. If sigma (the standard deviation) is 
known for the distribution, the percentage of cases in which any (observed) 
value could be expected to have come from the intervals (of the distribu- 
tion) to X +(7 , X - 1(J to X + 2(; or X - 3C/" to X + 3(^ is 

68.277o, 95.45%, or 99.73%, respectively, as shown. 
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68.27fi 





Figure 5: Norir.al Probability Frequency Distributions 



As an application, consider a situation where it is necessary to 
obtain the probability of detecting a submarine at a distance of 45 n.m. 
from the detection device (which is denoted by n.m.) ), given 

the FOM (Figure-of -Merit) , the pertinent PLP (Propagation Loss Profile), 
and the ^-value for the probability of detection. 

The first step is to visualize a Normal proabbility curve whose 
**x-axis" is located vertically along the 45 n.m. line of the PLP, and 
whose mean value is on the FOM line along the “x-axis,” as seen in 
Figure 6. 




Fif^ure 6: PLP (Proparatlcn Loss Profile) 



24 




Next, recall that the area under the Normal curve to either side of the 
FOM (the mean value for the distribution) corresponds to 507o probability 
of detection. Note that the n.m.) is represented by the darkened 

area under the Normal curve and under the PLP curve at 45 n.m., and that 
the intersection of the PLP and the axis of the probability curve is 
at -95 dB or -lO^ relative to the mean (the FOM). Now the n.m.) 

is calculated as the difference between 507o (the lower-half of the area 
under the Normal curve) and one-half- of the area (under the Normal curve) 
from -l(j to + l(j (which was previously shown to be 68.277o), or 
Pdet(^5 n.m.) - [sO - ^ (68.27)] 7. = 15.877. . 

(Other methods for obtaining probabilities are presented in Chapters IV 
through VI.) 

Supplementary solved problems covering special measures of sample 
data are presented at the end of Chapters 3 and 4 of Schaum's Outline 
Series, Theory and Problems of Statistics . These problems should be 
consulted for a further review of the material. 
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CHAPTER III: COMBINATORIAL ANALYSIS 



COMBINATORIAL ANALYSIS 

Combinatorial analysis involves application of counting principles 
to obtain probabilities of complex events. These basic principles are 
presented in this chapter. 

THE FUNDAMENTAL COUNTING PRINCIPLE 

If an event can happen in any of n^ ways, and if when this has occurred 
another event can happen in any of n^ ways, then the number of ways in 
which both events can happen is n^n^. Example: if 3 submarines are to 

transit 2 barriers, then the number of ways the submarines can transit 
the barriers are (2) (3) = 6 ways; if 5 submarines are to transit 3 barriers 
then there would be (3) (5) = 15 ways. 

FACTORIAL n 

Factorial n, denoted by nl, is equal to n(n - l)(n - 2)...l. Thus 
51 = (5)(4)(3)(2)(1) = 120, 4 : 3 : = ((4)(3)(2)(1))((3)(2)(D) = 144. It 
is convenient to define Ol = 1. Further, each factor in the factorial 
product must be a (nonnegative) integer. 

PERMUTATION PRINCIPLE 

A permutation is an arrangement of r out of n objects with attention 
given to the order of arrangement. The number of permutations of n objects 
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taken r at a time is denoted by nPr, P(n,r) or Pn,r and is given by 



nPr = n(n - 1) (n - 2) . . . (n - r + 1) = — 

(n - r): 

For example, how many ways may 5 destroyers be placed in a 5-position 
screen? The first position can be occupied by any one of the 5 destroyers 
that is, there are 5 ways of filling the first position. When this has 
been done there are 4 ways of filling the second position. Then there 
are 3 ways of filling the third position, 2 ways of filling the fourth 
position and 1 way of filling the last position. Therefore, n represents 
the 5 destroyers and r represents the number of destroyers that may be 
used in the screen which also is 5; hence, 

nPr = n! = 5! = ^ = IL = 5! = (5) (4) (3) (2) (1) = 120 

(n - r)l (5-5): 01 1 

Now suppose 5 destroyers are available for placement in a 3 position 
screen. How many ways may the destroyers be placed in the screen? The 
first position can be occupied by any one of the 5 destroyers, then the 
second position can be occupied by any of the remaining 4 destroyers and 
finally the last position may be occupied by any of the remaining 3 
destroyers. Therefore, n represents the number of destroyers which is 5, 
and r represents the number of destroyers that may be used which is 3; so 
that, 

nPr = 5P3 = n! = 5! = IL = (5) (4) (3) (2) (1) = 1^ = 60. 

(n-r): (5-3)1 2l (2) (1) 2 

COMBINATION PRINCIPLE 

A combination is a selection of r out of n objects without counting 
the order of arrangement. The number of combinations of n objects taken 
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r at a time is denoted by nCr, C(n,r), Cn,r or (^) and is given by 

nCr = n (n - I) ... (n - r + 1) = nj = nPr 

rl rl(n - r) ! rl . 

For example: Suppose a submarine with six torpedoes approaches a 

convoy of 10 ships. If the ships are selected at random and one torpedo 

is shot at each ship, how many ways may the torpedoes be shot at the 

10 ships? 

nCr = nl = 10! = (10) (9) (8) (7) (6) (5) (4) (3) (2) (1) = 

rl(n - r); 61(10 - 6)1 ( (6) (5) (4) (3) (2) (1) ) ( (4) (3) (2) (1) ) 

3.628,800 = 3,628,800 = 210 ways 
(720) (24) 17,280 

Now suppose 3 of the ships are warships protecting the convoy and it is 

desired to keep clear of these ships. If the remaining 7 ships are 

selected at random and one torpedo is shot at each ship, how many ways 

may the torpedoes be shot at the 7 ships? 

nCr = n! = 7 _[ = 7 ways 

r 1 (n - r) 1 61(7-6)1 

Supplementary solved problems covering permutations and combinations 
are presented at the end of Chapter VI of Schaum*s Outline Series, 

Theory and Problems of Statistics. These problems should be consulted 
for further review of the material. 
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CHAPTER IV: ELEMENTARY PROBABILITY THEORY 



ELEMENTARY PROBABILITY THEORY 

The concept of probability is commonly used as a means with which to 
state (or predict) the percentage of possible cases in which a specific 
event will occur; for example, the percentage of events in which a specific 
number will appear on a single toss of a die. The probability of such an 
event occurring is, in many cases, given as the ratio of the number of 
ways in which the event can occur to the number of ways in which ALL 
events can occur. So the probability that a two, for example, will occur 
on a single toss of a fair die is 

P(2) = 4 » 

6 

where the numerator represents the one way in which a two can occur, and 
the denominator represents the six ways in which each of the numbers 
1, 2, 3, 4, 5, 6 can occur; hence, the probability that a two will be 
the outcome of a single toss. of a die is one-sixth, or in other words, 
the ^’chances" of this outcome occurring--an event --are "one in six." 

Another method of specifying probability of an event is to state 
the probability in terms of proportional areas . The basis of this method 
is to think of the probability of occurrence of ALL events as an area, 
say "A," and the probability of a particular event is simply the ratio 
of the areas 

P(event of interest) = ^ , 

K 

where B is the area corresponding to the event of interest. 
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Probabilities specified as proportional areas can be represented by 
diagrams of the corresponding regions. A diagram of two regions with 
areas and is shown in Figure 7. 




Figure 7; Proportional Areas for Probability Analogy 

This figure facilitates understanding the probability of event by 

illustrating the area which is assigned to a region relative to the 
total area, **A,** assigned to the occurrence of ALL events. Since the 
diagram visually represents the probability of the event, it is a useful 
means of intuitively developing the concept of probability. (A carefully 
cultured intuition can be very useful when dealing with probability.) 

It is necessary here to define some terms which will make the 
formulation of basic probability theory easier. The first of these terms 
is PROBABILITY EXPERIMENT, which is defined as a type of trial procedure 
for selecting a specific outcome. An example is that of tossing a die 
to see what number comes up and noting the occurrences (or of flipping 
a coin to see whether '^heads'* or "tails'* occurs). 

Next, a SAMPLE SPACE is the totality of outcomes which can occur 
for the probability experiment. In terms of plane regions, the sample 
space is the large region (**A" in the previous example) which contains 
all possible outcomes of a specific probability experiment. 
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An OUTCOME is a specific value (or set of values) resulting from a 
single iteration of a probability experiment, and it is the basic 
element (or smallest unit) of the sample space (for the specific 
probability experiment) . An example would be the occurrence of one 
specific permutation (of the 216 possible permutations) of the numbers 
appearing on the three die in the (probability) experiment of tossing 
three dice on a single toss and noting the numbers which appear, where 
the goal of the experiment is to see whether or not a one, a two, and a 
three appear on the toss. This can be represented as one of the totality 
of points that make up the sample space (a three dimensional space, as 
each outcome is composed of three numbers) . 

An EVENT, on the other hand, is a set of outcomes ; in the previous 
example, an outcome is the sequence of the three specific numbers which 
occur as a result of the toss of the three dice, and the event consists 
of six points of interest (outcomes) in 3-space, as shown in Table 4. 



( 1 
( 1 
( 2 
( 2 
( 3 
( 3 



Die #2 
2 

3 

1 

3 

1 

2 



Die 

3 ) 
2 ) 
3 ) 
1 ) 
2 ) 
1 ) 



Table 4: The Six Cutcoges That Cogprise (Satisfy) 

the Ever. t of Cbtaininr a Cne, a Ta’o, and 
a Three on a Single Toss of Throe Dice 
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In terms of the proportional areas analogy, an event is an area ('*B” in 
the earlier example) which contains one or more outcomes and is a sub- 
area . of the sample space ("A). An extension of this terminology is the 
term JOINT EVENTS, which describes a combined event in which two or more 
simple events occur on the same iteration of the experiment. 

In summary, the sample space is comprised of all possible outcomes 
of a probability experiment, and these outcomes are grouped into sets 
called events--outcomes are the elements of an event. 

The remaining basic concept in the theory is that of understanding 
the PROBABILITY FUNCTION. A probability function is a device by which 
a number (or area) in the sample space is assigned (as previously 
mentioned) to an event. An example of a probability function is the 
assignment of the value 1/3 to the probability of each of the events 
that a single submarine, during a single transit, will cross one of 
three barriers each of which the submarine is equally-likely to cross. 
This is written as 

P(SS transits #1) = 1/3 

P(SS transits #2) = 1/3 

P(SS transits #3) = 1/3 , 

which can also be represented in an area diagram as in Figure 8, where it 

is seen that all three areas are equal and that no area overlaps any 
other, which is interpreted to mean that the submarine can cross only 
one of the barriers during a single transit. 
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Area 1 


Area 2 


Area 3 


= 1/3 


= 1/3 


= 1/3 



Figure 8: Prcportlcnal Area Depiction of 

Equal-likelihood Probability Barriers 



There are basic restrictions on the probability function. One of 
these is that the values assigned to events by the probability function 
must sum to one, or 



n 

Z Pi = 1 , 
i=l 

where i corresponds to the i^^ event. 

Another is that the values assigned must be between zero and one, or 

0 ^ p. 1 . 

^ 1 

This restriction is a result of the fact that probability is a ratio of 

a small area to a larger one. Also, since a specific event in a given 

experiment will either occur or not occur , the sum of the probabilities 

that the event will and the event will not occur must equal one. This is 

represented mathematically as 

P(Specific event occurs) + P(Specific event = 1 

does not occur) 



or by rearrangement 

P(Specific event does not occur) = 1 - P(Specific 

event occurs) 



or 



P(Specific event occurs) =* 1 - P(Specific event 

does not occur) . 
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As seen by the last two arrangements of the first equation, the probability 
of an event occurring and the probability of the same event not occurring 
on the same trial of an experiment are complements of each other in the 
probability sense; that is, the probability of an event occurring can 
also be described by one minus the probability of the same event not 
occurring* As will be seen later, this idea of PROBABILITY COMPLEMENT 
of the occurrence of an event has many useful computational applications. 

We now pass from probabilities for single events to probabilities 
for several events. There are two cases of interest: the probability 

of joint events , and the probability of at least one of two (or more) 
events. Joint events are denoted by the word "and”; for example, the 
probability of events A and B occuring. The other case is denoted by 
the word "or"; for example, the probability of event A or B (or both 
occurring, where A or B or both is the sense in which it is used in this 
course). This "or" probability is shown by the proportional areas 
analogy as in Figure 9. 




Figure Frotability of Events A "or" B 

It is seen from this figure that the probability of event A or B is 
represented as the probability of A plus the probability of B, minus 
the probability of A and B occurring. The reason for subtracting the 
joint probability of A and B is that the area corresponding to the joint 
probability ("cross-hatched" in the figure) is common to both event A and 
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event B, so that when area A is added to area B the "cross-hatched” area 
is added twice ; therefore, the joint probability must subsequently be 
subtracted once. The probability of A or B occurring is therefore expressed 
mathematically as 

P(A or B) = P(A + B) = P(A) + P(B) - P(A and B) 

= P(A) + P(B) - P(A • B). 

Figure 10 shows a case in which events A and B have no area of overlap; 
that is, A and B do not have any area in common. 




FlfTure 10: Probatility of Events A "or" 2 vhen 

Events .-i. and 3 are I-.utually Exclusive 

It is easily seen from this figure that any outcome in event A cannot 
possibly also be a part of event B; in fact, the existence of the outcome 
in either A or B precludes its simultaneous existence in the other event. 
Two such events are said to be MUTUALLY EXCLUSIVE. 

The joint probability of A and B can be shown through the proportional 
areas analogy as in Figure 11. 




Flr.ure 11: The Joint ri-'obofcill ty of Events and 3 
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It is seen from the figure that the probability of the joint event is 
the ratio of the "cross-hatched" area (the joint event itself) to the 
entire area (which is equal to one since it is a probability space). 
Mathematically the joint probability of events A and B is represented as 
P(A and B) = P(A • B) = P(A) + P(B) - P(A + B) . 

(This is simply a rearrangement of the "or" probability equation developed 
previously.) 

The symbol P(B A) denotes the CONDITIONAL PROBABILITY of event B 
given that event A has already occurred; that is, it is the probability 
of B on the condition that event A has occurred previously. It is seen 
from Figure 12, that P(B A) differs from P(B) by the fact that the 
sample space (S) from which B is drawn is larger than the resultant 
sample space (S’) from which B is drawn when A has been given to occur 
on this same outcome (the event "B given A"). This is due to the fact 
that S has been reduced to S' after it is known that the event yet to 
occur (B) is contained in the event that has already occurred (A) ; 
therefore, in terms of the ratio idea, the denominator has been decreased, 
thereby increasing the probability of B for the conditional case (relative 
to the unconditioned case) . 




Figure 12; Ccri:parircn of Unccnci tiered Probability to 
Condlticnal (or Ccnditicned.) Probability 

CONDITIONAL PROBABILITY actually epitomizes the usefulness of probability 
to ASW. That is, generally, an area is searched for the presence of a 
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submarine, and both the cases in which the submarine 1) has or 2) has not 
been detected as a result of previous search efforts are important 
information. Each allows the searcher to refine his prediction of the 
target’s position (in most cases) to a smaller and smaller portion of 
the original search region and, thereby, increase the probability of 
detecting the submarine on subsequent portions of the search. It is 
obvious how this occurs in the case where the target is detected on a 
given previous effort, but the case of no detection on previous search 
efforts may not be so obvious. The fact that the submarine was not 
detected on previous portions of the search allows the searcher to 
eliminate (again, in most cases) that portion which has been searched-- 
similar in idea to ’’sterilization"-- thereby conditioning the original 
sample space (this should be visualized as multiple iterations of the 
process depicted in Figure 12). Having previous knowledge (intelligence 
data) which allows the searcher to eliminate portions of the (original) 
search area is another way in which this conditioning can occur. 

The joint probability of events A and B (discussed previously) can be 
represented as 

P(A and B) = P(b|A) • P(A) . 

This equation has more intuitive appeal than the previous equation for 
joint probability, as it is simply an extension of the ’’Counting Principle"; 
it indicates that the probability of both event A and B occurring is the 
probability that event A occurs and "event" B given A occurs. 

From the last equation it is seen that if P(A) 0, then the 

conditional probability of B given A is mathematically represented as 

P(B|A) = P(A • B) / P(A) . 
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Three important methods for computing conditional probabilities are: 

1. By utilizing the definition of conditional probability 

P(b|A) = P(A • B) / P(A) . 

For example, it is known (by some means or another) that a submarine 
skipper will not choose one of three known and equally-likely barriers 
to transit, then the resulting probability is increased beyond that for 
the original choice due to the sample space drawn on having been reduced 
from three possible paths to two. -If the barrier eliminated is #1, then 
the resulting sample space S* consists of barriers #2 and #3, vice the 
original sample space S which was made up of barriers #1, y/2, and #3. 
Therefore, the probability that the submarine will transit barrier #2, 
given it will transit either iH or y/3 is seen to be 

P(b|A) = P(A • B) = P [ an or #3) and (#2)] 

P(#2 or #3) 

= ?(iH) since the only way in which #2 and 

P(y/2 or y/3) (y/2 or_ y/3) can occur is if occurs 

= 1/3 = 1/3 = 1/2 . 

1/3 + 1/3 2/3 

2. By simply writing the conditional probability down, as sometimes 
the information that A has occurred enables the determination of the 
probability of B. 

For example, in the experiment of drawing two cards from a deck of 
playing cards one at a time without replacing the drawn card in the deck, 
it is known that the first card drawn is a Spade. Given that the first 
card is a Spade, the probability that the second card is also a Spade 
can immediately be written down as 

P(draw y/2 is a Spade draw y/1 is a Spade) = 12/51 , 
since the original 13 Spades in the deck have been reduced to 12, and 
the original 52 total cards (the sample space) to 51 by the first draw. 
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The probability thus formed is consistent with the ratio concept of 
probability.. 

3. By utilizing Bayes* law, which will now be presented without 
proof and with only the following restriction. 

Bayes* law holds for experiments on sample spaces which consist of 
definite groups of events that do not ’’overlap.” Such a sample space is 
seen to correspond to the delimited area/s in Figure 13. 




Such a sample space is said to be PARTITIONED. 

Bayes* law states that on a partitioned space the probability of an 
event A occurring, given that event B has previously occurred is, 

P(AIb) = P(BIA) • P(A) , 

P(B|A) • P(A) + P(B|A^) • P(A<=) 

c 

where P(A ) is the probability complement (covered earlier) of P(A), or 
P(A^) = 1 - P(A). 

For example, suppose it is known from intelligence reports that 
missiles of type C and D are carried by exactly two classes of Soviet 
submarines, say class A and class B. Submarines of class A are known 
to carry 16 missiles of type C and eight of type D, while those of 
class B are known to carry ten of each. A missile is fired from a 
submarine and observed to be of type C. Find the probability that the 
missile was fired from a class A sub, under the following conditions: 
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1. Each missile to be fired is chosen at random from the total 



supply of missiles aboard the submarine, and 
. 2. the Soviets normally operate twice as many subs of class B as 
of class A in the area. 

The problem is to find P(A C) ; it can be found using Bayes* law. To 
determine the values to be used, it is best to first draw two blocks 
representing the class A and B submarines and the types of missiles they 
carry as shown in Figure 14. (Remember that there are two class B subs 
for each class A.) 



16 C missiles 




10 C missiles 


8 D m:issiles 




1 0 D missiles 


^ missiles 




20 missiles 



Class A Eubcarine Tvjo Class B submarines 



FI pure 1 4 



By Bayes' law then 



p(a1o = 



P(ClA) • P(A) 

P(C|A) • P(A) + P(C|A^) • P(AC) 
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here A^ = B; therefore, 



= P(cIa) > P(A) 

P(C|A) • P(A) + P(CjB) * P(B) 

= (16/24) > (1/3) =2/9 = 2/5 . 

(16/24) • (1/3) + (10/20) • (2/3) 5/9 

As a second example of the application of Bayes* law, consider a 
circuit breaker used to protect a certain piece of ASW hardware from 
short circuits. At the moment the "on” switch is thrown to activate 
the equipment, the circuit breaker essentially "tests" the circuit for a 
short. If a short is detected, the circuit breaker should immediately 
trip; otherwise it should allow operation to continue. 
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Suppose it is known that although short circuits do not always trip 



the circuit* breaker , they do trip it 99.5% of the time. On the other 
hand, suppose the probability that the circuit breaker trips when there 
is no short is 3%. If the probability that there is a short in the 
equipment when the switch is thrown is 0.01, find the conditional 
probability that there is no short circuit given that the circuit breaker 
does not trip. 

The technique of solution involves several steps: 1) describe 

mathematically the probability ’’events” involved; 2) formulate what is 
given and immediate consequences of the given; 3) formulate the unknown; 

4) write down Bayes’ law as it applies to the events of this problem; 

5) substitute the given numbers and solve. 

1) Let: T be the event that the circuit breaker trips; 

the event that the circuit breaker does not trip 

S the event that there is a short circuit when the ”on” 
switch is thrown; 

the event that there is no short when the ”on” 
switch is thrown. 

2) We are given that 

P(t1s) = 0.995, PCTjs^^) = 0.03, P(S) = 0.01. 

From the given it follows immediately that 
PCT'^js) = 0.005, PCT^^js^) = 0.97, P(S*^) = 0.99. 

3) The problem is to find 

pCs'^Itc) . 

4) By Bayes’ law, 

P(SC|tC) = P(T^ISC) • P(S^) 

' PCT^IS*^) • PCS^^) + P(T*^|S) • P(S) . 

5) Substitute: 
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(0.97) (0.99) 



(0.97) (0.99) + (0.005) (O.OL) 



Q • = .999948 



0.9603 + 0.00005 



STATISTICAL INDEPENDENCE 

Another important and useful concept is that of STATISTICAL 
INDEPENDENCE. Two events are called statistically independent if know- 
ledge of the occurrence of one does not affect the probability of 
occurrence of the other. What this means for events A and B, where 
P(A) 0 and P(B) » 0, is that P(b|a) = P(B) and P(A|B) = P(A). From 
conditional probability it is known that 



but since P(B|A) = P(B), it is said that A and B are statistically 
independent if 



For example, if an air screen barrier and a destroyer barrier are 
placed about a convoy in such a manner that the probability of detection 
on one of the screens does not affect the probability of detection on 
the other screen, then the joint probability of detection of the submarine 
in both screens is simply the product of the probability of detection 
from each of the screens. Therefore, if the air screen barrier has a 
0.50 probability of detection and the destroyer screen barrier has a 
0.30 probability of detection, then the probability that the submarine 
will be detected on both screens is 




P(B and A) = ?(B • 



A \ 




P(A • B) = P(A) • P(B) . 



P(Detection on both ASB and DSSB) = (0.50) (0.30) = 0.15. 
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This probability is, of course, contingent upon the submarine attempting 
to penetrate the screens. (The fact that this joint probability is 
smaller than either of the ( marginal ) probabilities which comprise it 
may, at first, seem intuitively incorrect; that is, it is easy to 
misinterpret the statement of what probability is being sought in such 
a problem. However, when it is realized that what is sought here is 
the probability that both platforms simultaneously hold contact on the 
submarine, the smaller resultant (joint) probability then seems more 
intuitively consistent.) 

A major distinction between statistically independent events and 
mutually exclusive events is that the joint probability of the mutually 
exclusive events is zero, since the occurrence of one of a group of 
mutually exclusive events precludes the (simultaneous) occurrence of 
any other event in that group. The joint probability of independent 
events, on the other hand, is clearly not zero unless at least one of 
the independent events has zero probability of occurrence; If A and B 
are independent, then 

P(A and B) = P(A) ‘ P(B) = 0 , 
if and only if P(A) = 0 or P(B) = 0. 

Supplementary solved problems covering elementary probability theory 
are presented at the end of Chapter 6 of Schaum*s Outline Series, Theory 
and Problems of Statistics and at the end of Chapters 3 and 4 of Schaum's 
Outline Series, Theory and Problems of Probability . These problems should 
be consulted for a further review of the material presented in this 
chapter. 
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CHAPTER V: RANDOM VARIABLES 



RANDOM VARIABLES 

A variable which assumes a numerical value determined by the out- 
come of a probability experiment is called a random variable . Actually 
a random variable is a shorthand notation for listing each possible out- 
come of an experiment. For example, if a coin is tossed two times and 
the number of heads is counted, then the random variable may assume the 
value 0, 1, or 2. 

There are two types of random variables, discrete and continuous . A 
discrete random variable is a random variable X on a sample space which 
has a finite or countably infinite number of outcomes; say, X(S) = 
x^, X 2 >...,Xj^ . We can make X(S) into a probability space (conditions 
on this are listed with the probability distribution below) by defining 
the probability of x^ to be P(x = i), which may also be written f(x^). 
This function f on X(s), defined by f(x^) = P(X = x^) , is called the 
distribution or probability function of X and is usually given in the 
form of a table, as shown in Table 5. (It should be noted that in the 
discrete case the distribution is the probability function developed 
in Chapter IV.) 



^1 


X2 


• • ♦ 




f(x^) 


f (x2) 


• • • 


r(xn) 



Table 5: Distribution Function of X 
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The distribution f satisfies the conditions: 

n 

f(x.) i 0 and I f(x ) = 1 

i=l - i 

For example: suppose a pair of fair dice is tossed. We obtein the 

finite equiprobable space S consisting of the 36 ordered pairs of 
numbers between 1 and 6: 

S = (1,1), (1,2), (1,3),..., (6,6) 

Let X assign to each point (a,b) in S the maximum of a and b; that is, 
X(a,b) = MAX (a,b) = X(S) = 1, 2, 3, 4, 5, 6. We compute the distribution 
f of X: 

f(l) = P(X = 1) = p( 1,1 ) = 1/36 

f(2) = P(X = 2) = P( (2,1), (2,2), (1,2) ) = 3/36 

f(3) = P(X = 3) = P( (3,1), (3,2), (3,3), (2,3), (1,3) ) = 5/36 

and similarly f(4) = P(X = 4) = 7/36, f(5) = P(X = 5) = 9/36 and 

f(6) = P(X = 6) = 11/36. The information is put into the form of a 

table as shown in Table 6. 



Xi = 


1 


2 


3 


4 


5 


6 


f(Xi) = 


1/36 


3/36 


5/36 


7/36 


5/36 


11/36 



Table 6: Distribution f of X 



It can also be seen that f(x£) -25 0 and 

6 

Zf(x.) = 1/36 + 3/36 + 5/36 + 9/36 + 11/36 = 36/36 = 1. 
i=l 

A continuous random variable is one in which the set of numbers 
X(S) is an interval or a set of intervals. If the interval (a X b) 
is an event in S, then the probability would be P(a ^ X ^ b) and is 
shown in Figure 15. 
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Figure 15: F(a ^ X ^ h) = Area of Shaded Region 
Where P(a ^ x b) corresponds to -the area under the graph of f between 



X = a and x = b. In the language of calculus, P(a x ^ b) = (x) dx, 

which in effect can be approximated by adding up areas under the curve 
between a and b. For example, take the area shown in Figure 15 and 
partition it into smaller areas as shown in Figure 16a and b, and then 
sum the areas of the resulting rectangles. 





a 



b 



Fi gure 1 6 a 




a 



b 



Figure I6b 
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An approximation of the area of each subregion of Figure L6a and 16b 
can now be made by taking the area of the rectangle which has the same 
base as the subregion and height equal to the minimum height for that 
subregion. By adding up the areas of the approximating rectangles, an 
approximation is now obtained for the area of the total region between 
a and b. The area that has not been included is the shaded area, which 
makes the results only an approximation. It can be seen that, due 
to the increased number of partitions each with a smaller base of 
Figure 16b, the shaded area is smaller, which results in a better 
approximation. Actually the greater the number of partitions, the 
smaller the shaded area, and the better the approximation. If f(x) is 
known, the exact area between a and b may easily be determined by using 
calculus. However, it should be noted that the use of calculus is not 
required for most practical cases of interest in this course, as values 
are obtained from standard tables. Standard tables exist for most 
continuous random variable distributions used in ASW applications; there- 
fore, it is not intended that the student have a working knowledge of 
calculus. The following example is given, therefore , simply to show an 
appreciation of calculus: 

Let X be a continuous random variable with the following distribution: 




Figure 17: Graph of f 
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1.5 



Then P(1 ^ X ^ 1.5) 



= f f (x)dx = f 
-'a -'l 




5625 - .25 = .3125 



where it is seen that the calculus operati'on of integration has been 
utilized to "sum" the area of the region under the curve between 1 and 
1.5 as shown in Figure 17. 

DENSITY FUNCTION 

The function that describes the probability of each interval on the 
x-axis (for example, the bimodal function, f(x), in Figure 15) is called 
a continuous probability function or simply the (probability) density 
function of X. The density function must satisfy the constraints that 
f(x) ^ 0 for all X, and that the total area under the density function 
must equal 1. Special notice should be given the fact that the 
probability of a point-value of the random variable X is strictly equal 
to zero, even though the value of f at the point x may be non-zero. An 
intuitively appealing explanation of why the probability of a single 
outcome of a continuously distributed random variable is zero is given 
in the following example. 

Consider an attack submarine (SSN) searching for enemy submarines. 

The probability that an initial detection bearing will be exactly some 
prescribed angle is zero. The key word here is exactly . For example, 
the word "exactly" would mean that a prescribed bearing of 30^ is 
exactly that right down to every decimal place. If the "exactly" is not 
used, then "30^" may represent a whole event, which would have a positive 
probability. For example, if we are rounding off to the nearest degree 
and the density function is constant over the interval from 0° to 360^, 
then "30^" would mean the event from 29.5000...° to 30.499...®, which has 
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probability 1/360. If we are rounding off to the nearest minute, then 
"30°" refers to the interval from 29°59.500. . . ' to 30° 0. 599 . . . ' , which 
has probability 1/21,600. If we round off to the nearest second, then 
"30°" is the event from 29°59' 59,500” to 30°0'0.499. . which has 
probability 1/1,296,000. "Exactly 30°" is a single point , not an interval, 
and its probability must be smaller than the probability of every 
interval that contains it, no matter how small. Thus the probability of 
a bearing angle of ^exactly 30®** is zero. 

CUMULATIVE DISTRIBUTION FUNCTION 

Let X be a random variable, x a real number, and F(x) the probability 
that X takes on values less than or equal to x: 

F(x) = P(X ^ x) 

Then the function F is called the cumulative distribution function 
(CDF) of X. CDF values are of particular importance in ASW in their 
application to search regions. For example, (in the continuous random 
variable case) it is important to know what the probability of contact 
on a target along the search length, L, is up to and including the 
point X = L/2, where exactly half (or 1/3 or 3/4 or some other portion) 
of the area has been searched. This type of information is essential to 
allow reasonable allocation of available search effort. Further, the CDF 

I 

of the target evaluated at the x^^ value corresponding to that point 
half-way (in this example) through the search region represents this 
desired probability (cumulative), as seen in Figure 18, where, for purposes 
of generality, x is taken to be the middle or center x value of the search 
region’s axis. 
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For example, using the data shown in Table 6, the cumulative distribu- 
tion function F of X would be as shown in Figure 19 which consists of 
summing the frequencies, f(xi), corresponding to those values of x less 
than or equal to x^. 




Figure 19* Graph of F 

Observe that F in Figure 19 consists of a sequence of horizontal line 
segments and two rays. One ray coincides with that part of the x-axis 
to the left of X = 1 because P(X ^ x) is 0 if x -c 1. There is a jump 
in the graph at x = 1 because F(x) = 0 to the left of 1 and F(l) = 1/36. 
The graph has constant height F(x) = 1/36 for l^fx^ 2. Atx=2 
there is a jump of 3/36 (or f(2) = 3/36) bringing F(x) up to F(2) = 4/36. 
As we proceed along the graph from left to right, we find a jump at each 
integer x = 1, 2, 3, 4, 5, 6. For x ^ 6, the graph has another ray 
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extending indefinitely to the right because F(x) = P(X!f x) is 1 if X is 
any real number greater than or equal to 6. If we desire to find the CDF 
for a number between the integer values, for example F(5/2) we can see 
from Figure 19 that F(5/2) = 4/36. To compute the CDF of a continuous 
random variable, it is again necessary to use calculus; however, its 
values can usually be obtained from a standard table. 

MEAN, VARIANCE AND STANDARD DEVIATION 

The mean , variance and standard deviation are also used to describe 
a random variable. These will be shown here for discrete random variables 
only, as the values for continuous random variables will be found from 
tables. 

The mean or expectation of X, denoted by E(X) (or simply E or fJ ) 

is defined, for discrete random variables, by E (X) = f(x ) ^2 ^^^ 2 ^ 

n 

••• + f(Xn) = I f(x.) . 
i=l 

That is, E(X) is the weighted average of the possible values of X, 

where each value is weighted by its probability. For example, to compute 

the mean of X for the data of Table 6, 

6 

E(X) = I Xi f(x.) = 1(1/36) + 2(3/36) + 3(5/36) + 4(7/36) + 5(9/36) + 
i=l ^ 

6(11/36) = 161/36 = 4.47 . 

The mean of a random variable X measures, in a certain sense, the "average** 
value of X. 

The variance of X measures the **spread*' or **dispersion** of X. Let X 

be a random variable with the distribution shown in Table 5. Then the 

n 

variance of X, denoted by Var(X), is defined by Var(X) = I (x . - jLf)^f(x^) 

2 ' 

=E((x - pr ) where p is the mean of X. For example, using the data 



51 



1/36 + 



from Table 6 Var(X) = Z (x . - /i)^ f(x ) = (1 - 4.47)^ 

i*l 

(2 - 4.47)^ 3/36 + (3 - 4 . 47 )^ 5/36 + (4 - 4.47)2 7/36 + (5 - 4.47)^ 9/36 + 
(6 - 4.47)2 11/36 = .334 + .503 + ,3 + .043 + .07 + .715 = 1.97 . 

Since the standard deviation of X, denoted by rf. .. is the 
(nonnegative) square root of Var(X); = ~\f Var(X) . For example, 

using the data from Table 6 , the standard deviation 

(j^ = V Var(X) = V 1.97 = 1.4 . 
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CHAPTER VI: BASIC STATISTICAL DISTRIBUTIONS 



BACKGROUND 

Generally, systems which arise in solving many practical ASW problems 
must be describable in terms of a simple mathematical model , and in most 
interesting cases this mathematical description is in probabilistic 
(rather than deterministic) terms. Before a problem can be analyzed for 
probabilistic solutions the basic unknown quantities must be represented 
as random variables (which constitute the mathematical model). Once 
this is done, the density function or the CDF is determined, and the 
system is thereby completely described in probabilistic terms. The only 
task remaining then is to compute the probabilities which constitute a 
problem solution, and this in itself is theoretically straight forward. 
However, it should be noted that some problems can be modeled more 
easily than others; yet an important advantage of this method is the 
possibility of describing many ASW problems in terms of a few ’’well 
known” random variables. The distributions which are developed here are 
the Bernoulli, Binomial, Chi-Square, Uniform, Normal, Bivariate Normal, 
Time-Varying Normal, Exponential and Poisson. 

BERNOULLI DISTRIBUTION 

A random variable X with exactly two possible values, say 1 and 0, 
where the probabilities P ^ X == ij = p and P |^X = oj = 1 - p = q, is 
called a Bernoulli random variable. Such a random variable is important 
because it may be used to describe any random experiment with two possible 
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I 



outcomes, such as target detection from a single ping of an active sonar. 
Indeed, it is the Bernoulli distribution that is used in making a decision 
requiring a **Yes'* or "No’* answer. However, for our purposes Bernoulli 
random variables are used primarily as the basis for the Binomial 
distribution, which is discussed next. 

BINOMIAL DISTRIBUTION 

The Binomial distribution uses the two alternatives of the Bernoulli 

distribution, P | X = 1 j = p and P = o| = q, and an n-fold repetition 

of the basic experiment. The problem now is to find the probability of 

exactly i successes in n repeated independent trials; for example, the 

probability of exactly two detections in six pings of an active sonar, or 

the probability of one detection in four sweeps of a radar. The Binomial 

random variable with parameters n and p is the discrete random variable 

given byP ^X=ij “(i i = 0, 1, 2, ... , n, and P ^ X = ij = 0 

otherwise. (Recall that (?) was defined in Chapter III as • 

il(n - i): 

In order to use the Binomial distribution to find the probability of 

exactly 2 (or i) detections in 6 (or n) pings of an active sonar, the 

probability of a single ping success must be known. For example, if 

the probability of success is 0 . 6 , or p = 0 . 6 , then q = 1 - p = 0.4 and 

therefore P [x = 2 ] = (?) = (|) (0.6)^ (0.4)^"^ = 

-7 (0.6)2 (0.4)4 = 6 • 5 • 4 • 3 • 2 • 1 (0.36) (0.0256) = 

21 (6 - 2): 2 • 1 • 4 • 3 • 2 • 1 

(15) (0.36) (0.0256) = 0.138 . 

The mean of a Binomial random variable with parameters n and p is /J = np, 
the variance is (j^ ~ standard dcvia tion Cf = npq . In 

the above example, = np = ( 6 ) ( 0 . 6 ) = 3 . 6 , (j^ = npq = ( 6 ) ( 0 . 6 ) (0.4) 

1.44 and cr-VT .44 = 1.2. With a probability of 0.6 and 6 pings, we 
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would expect 3 to 4 detections from every 6 pings on the average; for 
this problem (j indicates, with a 68% confidence, that the number of 
detections from 6 pings is between 2.4 and 4.8 on the average. 

The following problems show how the binomial distribution can be used; 

1 . A P- 3 C aircraft has been loaded with eight active sonobuoys 
which are known to have a 20 % failure rate. These buoys are to be 
utilized in a situation requiring that failed buoys be replaced 
immediately to maintain pattern integrity. As this requires fast, 
coordinated action by the aircrew to reposition the aircraft for ’’delivery” 
and to prepare replacement buoys, it is important for them to estimate 
the number of buoys requiring replacement. Determine the probability 
that, out of four sonobuoys selected for a pattern, 

(a) 1 sonobuoy will fail 

(b) 0 sonobuoys will fail 

(c) 2 sonobuoys will fail 

(^) at most 2 sonobuoys will fail. 

(Note that part (d) can be simplified through use of parts (a), (b), (c) , 
and the probability complement concept.) 



(a) 


"1 




= (t) 


(0.8)3 


(0.2)^ = . . (0.2)(0.8)3 

1: (4 - 1): 








= (4) 


(0.2) 


(0.51) = 0.4096 


(b) 


0.4096 








(c) 


0. 


1536 








(d) 


P 1 


[x= 2] 


= 1 - 


■ P » 2] = 




1 


- [ (3) 


(0.2)3 


(0.8)^ 


+ (4) (0.2)"^ (0.8)°] = 0.9728 



2 . If the probability of damaging a submarine with a single depth 
charge from a destroyer is 0.5, what is the probability that 6 out of 10 
depth charges will damage the submarine? 
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Answer: 0.21 



3. Compute the (a) mean, (b) variance, and (c) standard deviation of 
a binomial distribution in which p = 0.7 and n = 60. 

Answer: (a) 42 

(b) 12.6 

(c) 3.55 



4. Consider a box of 200 fuses. It is known that for each fuse the 
probability of being defective is 0.02. What is the probability that 
at most six fuses are defective? (Set up the problem but do not carry 
out the calculations; although the solution is not too difficult if one 
has the use of a pocket calculator. It will be seen later that a good 
approximation to this answer can be obtained much easier by means of 
the Poisson approximation.) 

Answer: ... 6 



p[x = e] = I 
I J il (n - i) 1 



6 

= I 



2001 /n /n qq\200 



i 1(200 - i)l 
= 0.8914 



— (0.02)^ (0.98)^ 



UNIFORM PROBABILITY DISTRIBUTION 

The Uniform probability distribution represents situations in which 
any two sub-regions of the same size have equal probabilities, as explained 
below. This distribution is shown in Figure 20, which shows that the 
random variable X may assume only values between X = a and X = b, or 
a If X2 b, with non-zero probability (the probability of any region 
outside the interval a ^ X ^ b is zero). 
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Figure 20; Unifcria Frob:ifcility Distribution and Its CDF 



Since the area under a probability distribution must equal one, height 

of the density function between a and b must be I , The mean value 

b - a 

of a Uniform distribution coincides with the central x-value, and is 
(b + a)/2. The variance is (a - b)^/12. The figure also shows the 
cumulative distribution function for the Uniform distribution. 

A Uniform distribution is described by writing "Uniform (a,b) or 
U(a,b ) distribution"; for example, a Uniform distribution having its 
non-zero probability values in the interval 0 to 1 is denoted as "a 
Uniform zero, one distribution." 

An example application of the Uniform random variable is the 
probability distribution of the location of a submarine in a narrow or 
restricted body of water. If it is known that the submarine will not be 
in water less than 100 feet deep, then the probability that the 
submarine will be in such regions is zero. If there is no information 
to indicate any preference in location, it must be assumed that the 
probability distribution for the location of the submarine is uniform , 
as shown in Figure 21. 
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Figure 21: Profile of a Narrcv; Channel and Associated 

Uniforo: Probability Distribution 



Suppose that a = 1,500 yards and b = 20,000 yards from a reference 
position (at one of the restriction boundaries, for example). The 
probability that the submarine is located in any subinterval, say of 
width 1,000 yards, is 



pfc<X<dl =(d-c) 1 

L J (b - a) 

= (1,000 yards) — i yards 

18,500 

= 0.05 



Special note should be made of the fact that the dimensions (yards, in 
this example) associated with the physical problem must cancel so that 
the resultant probability is a dimensionless number. The probabilities 
of any two intervals of the same length within the interval from a to b 
are equal; that is, 

pjc:^x^d] = p[e^x:ff] ifd-c = f-e, 
where both c ^ X d and e X f are within a X b. 

Suppose that two sensors at separate locations each produce a line of 
bearing (LOB) to a submarine target. Each of the LOB's has a bearing 
error of ten degrees (1^ 5^) associated with it. Suppose further that the 
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LOB*s intersect in a plane region "centered'' 117.5 n.m. from the first 
sensor and 66.5 n.m. from the second. The information available to a 
prosecuting platform is that the location of the submarine is Uniformly 
distributed over the plane region. Since a bearing error of 1 degree 
represents a range error of about 1 n.m. (perpendicular to the LOB) at 
a range of 60 n.m., the area is a trapezoid as shown by the "cross- 
hatched" region in Figure 22. 




Figure 22: Trapezoid Produced ty Tv;o Sensors 

Suppose sensors #1 and #2 are separated by 135 n.m. and the LOB's are 
60 degrees from sensor #1 and 330 degrees from sensor #2; then the 
trapezoid approximates a rectangle with sides 20 by 12 n.m. ; therefore, 

2 

the probability region has an area of about 240 n.m. . If a sonobuoy 
with a 3 mile radius of detection is dropped in the probability region. 
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(a) what probability of detection does this sonobuoy yield? 

(b) what probability of detection would six nonoverlapping 
sonobuoys yield? 

Answer: (a) Probability of Detection = area of sonobuoy 

area of trapezoid 

ss 7Tr^ = 28.27 = 0.12 
240 240 

(b) Probability of Detection = 0.71 

NORMAL DISTRIBUTIONS 

THE NORMAL DISTRIBUTION 

The Normal distribution is probably the most important continuous 
random variable used in ASW. It is used to describe the sum of a large 
number of statistically independent random variables. The Normal 
probability density function is defined as: 

1 2 2 

f(X) = - e ^ C) for all real numbers x, 

where ^ and are the mean and variance. We denote the Normal 

distribution with mean /i and variance (j ^ by N( /J , Cj^)- Figure 23 
shows a Normal distribution of N(0,1). 




Figure 23: Korr.al Distribution !l(0,l) 
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The probability that a Normally distributed random variable X is less 



than or equal to b, 
by: 

F^(b) 



(which is the Cumulative Density Function) is given 



= P 






oc 




O' 



1 e-Mx - 



dx. 



Evaluation of this intergal can be circumvented by use of standard 
tables. The probability that X is less than or equal to b can be 
represented by the shaded area in Figure 24, and its magnitude is 
determined with the aid of tables in Appendix A. 




Figure 24; The Density Function of the ITorrrial Distribution 



Note that the Normal density function is dependent only on the two 
parameters ^ and If P ^nd are both specified, probabilities 

can be obtained from the table in Appendix A. If 



fx(b) 



1 e-b^/2 . 



then the random variable X has a Normal distribution with mean 0 and 
variance 1, and X is called a Standard Normal random variable. Most 
tables are written for the Standard Normal Distribution; therefore, to 
obtain probabilities for any other mean or variance values the appropriate 
conversions must be made. To compute the probability that X lies between 
a and b, convert a and b into standard units: 

a' = (a - /i)/CT and b' = (b - ^)l(j 

respectively. 
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Then 



P(a^ b)=P(a'^ X*2 b')= area under the Standard Normal 

"k 

curve between a' and b', where X is the Standard Normal random variable 
The following problems are used to familiarize the student with the 
use of the Standard Normal distribution N(0,1) and Appendix A: 

1. Let X be a random variable with the Standard Normal distribution 



N(0,1). 



Find: 



(i) P(0 ^ X ^ 1.42) 

(ii) P(-0.73^ X ^ 0) 

(iii) PC-1.37 ^ X ^ 2.01) 



(iv) P(0.65 X ^ 1.26) 

(v) P(-1.79 ^ X ^ -0.54) 

(vi) P(X > 1.13) 



(i) P(0 ^ X If 1.42) is equal to the 



area under the Standard Normal curve 



between 0 and 1.42. Thus in Appendix 
A, page A-1, look down the second 
column until 1.42 is reached. The 
entry is .4222. Hence P(0 If X ^ 1.42) 




= .422. 

(ii) By symmetry, P(-0.73 If X If 0) = 
P(0 f X If 0.73) = .2673 

(iii) P(-1.37 f X f 2.01) = 

P(-1.37 f X If 0) + 

P(0 If X ^ 2.01) = 

.4147 + .4778 = .8925 

(iv) P(0.65 If X If 1.26) = 

P(0 If X f 1.26) - 
P(0 ^ X f 0.65) = 

.3926 - .2422 = .1540 
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(v) P(-1.79 ^ X 2 -0.54) = 

P(0.54 :f X ^ 1.79) = 

P(0 If X < 1.79) - 

P(0 ^ X ^ 0.54) = 

.4633 - .2054 = .2579 
(vi) P(X > 1.13) = 

P(X i 0) - P(0 ^ X ^ 1.13) = 

.5000 - .3708 = .1292 





2. Let X be Normally distributed with mean 8 and standard deviation 
4. Find: 

(i) P(5 X 10) (iii) P(X ^ 15) 

(ii) P(10 - X If 15) (iv) P(X 'Z 5) 

(i) P(5 ^ X 10), must first convert to the standard normal 

distribution N(0,1) 

P(a * X b)=P(a':f X* IS b') 
a' = (a - ^JO/Cf = (5 - 8)/4 = -.75 

b* = (b - /i)/CT = (10 - 8)/4 = .5 

therefore have P(-.75 r X * .5) = P(-.75 - X 0) + P(0* 0.5) = 

.2734 + .1915 = .4649 

(ii) P(10 If X If 15) = P(.5 S X* :S 1.75) = 

P(0 f X ^ 1.75) - P(0 1 X* ^ .5) = 

.4599 - .1915 = .2684 

(iii) P(X !: 15) = P(X* ^ 1.75) = 

P(X* 0) - P(0 - X* - 1.75) = 

.5000 - .4599 = .0401 

(iv) P(X 15) = P(X* -.75) = 

P(X*If 0) - P(-.75 - X- 0) = 

.5000 - .2734 = .2266 
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3. Suppose the temperature T during June is Normally distributed 
with mean 68^ and standard deviation 6^. Find the probability p that 
the temperature is between 70^ and 80^, 

70^ in standard units = (70 - 68)/6 = .33 

80^ in standard units = (80 - 68)/6 = 2,00 

Then p = P(70 !! T ^ 80) = P(0,33 T ^ 2) = 

P(0 * T* "Z 2) - P(0 * T* - 0.33) = 

.4772 - .1293 = .3479 

Here T is the standardized random variable corresponding to T, so 
T has the Standard Normal distribution N(0,1). 

4. Suppose that the lifetime of a certain type of electronic tube 
is a Normally distributed random variable. Suppose further that brand 
A has a lifetime X which is N(30 hours, 36 hours^), while brand B has a 
lifetime Y which is N(45 hours, 9 hours ) . Of these two brands, which 
would you choose for use in an experimental aircraft with: 

(a) 34 hours mission time 

(b) 40 hours mission time 

(Hin t: you should be able to solve these questions by simply sketching 

the two Normal distributions). 

BIVARIATE NORMAL DISTRIBUTION 

The Bivariate Normal and the Time-Varying Normal distributions are 
proposed inclusions to the computer program of the P-3 aircraft for 
tactical, on-station use of probability theory, and although a thorough 
development of the distributions is considered beyond the scope of this 
course, the following cursory description of each of these distributions 
should be sufficient for an intuitive idea of how these distributions 
are used. 
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The Bivariate Normal distribution is the joint probability distribu- 
tion of two Gaussian (or Normal) random variables. Basically, this 
distribution is the three-dimensional extension of the two-dimensional 
(the value and its frequency) Normal distribution. As seen in Figure 25, 
the Bivariate Normal distribution resembles a hat, and the specific 
Bivariate Normal distribution shown has a central value, which may or 
may not be located at the origin. 




Figure 25 » Bivariate NorLcal Distribution 

The Bivariate Normal distribution is useful in Search Theory, and 
for two-dimensional area coverage. 

If the student is interested in further development of the Bivariate 
Normal, reference should be made to Selected Methods and Models in 
Military Operations Research , P. W. Zehna, et_al, U. S. G. P.O., 1971, 
pages 49-61. 

TIME -VARYING NORMAL DISTRIBUTION 

The Time-Varying Normal distribution describes a Normal random variable 
whose distribution varies as a function of time as well as of another 
parameter such as range. This distribution varies with time, and 
Figure 26 shows an example where the mean is zero for all values of time 
t, but the variance is increasing with increasing t. 
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Firrure 2o: Tine-Varying* Ncrnal Distrlbutlcn 

The distribution shown in Figure 26 can be used to describe a submarine 
opening a reported datum, where a "datum outbound" situation is assumed 
to exist. As the figure illustrates, the probability that the submarine 
is in close proximity to the reported datum--as reported at time t^ or 0-- 
decreases with increasing time. This is indicated by the spreading out 
and flattening of the distribution from times t^ to t^. 

It is important to understand the development thus far presented to 
provide correct insight into what this distribution represents. Further, 
it is a definite advantage--and, in fact, a necessity for tactical, real- 
time usage--to have a digital computer to perform the mathematics associated 
with the Time-Varying Normal distribution resulting from an operator- 
input assessment of the tactical situation. The computer could then 
output the pertinent probabilities and options for operator use. Again, 
if the operator is to utilize the computer to employ the distribution, 
then it is imperative that he understand the tactical assumptions it is 
based on. 
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CHI-SQUARE DISTRIBUTION 



The Chi-Square random variable is useful for ASW applications in that 
it can be used to describe any system that is the sum of the squares of 
Normal independent random variables. Probably the most important of 
these applications is that of describing radial distance , where the 
distance is in terms of two or three directions, each of which has 
distance distributed Normally. Therefore, the radial distance to a 
specific point (from the origin) is the square root of the sum of the 
squares of the cartesian coordinates of the point (by the Pythagorean 
theorem) . 

An example which displays the utility of the Chi-Square distribution 
is to obtain the probability that a submarine is within 5 miles of a 
central datum (the origin, for convenience), where it is known that the 
probabilities of the submarine being at a distance in miles along each of 
the X and y axis are each distributed N(0,100) due to error in the system 
that initially located the submarine. The submariners position can be 
in error (or vary from the stated datum position) in both the x and y 
directions; therefore, its position may be specified by two random 
variables, X and Y. Although there are two random variables involved, 
the effect of using the Chi-Square distribution is a reduction of the 
problem to a single random variable R, which represents radial distance 
from the submarine to the central datum: 

R = Vx^ + . 

The problem is to find the probability of the shaded region shown in 
Figure 27. 
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Fls^re 27 : Sun of Squares Less Than or Equal to 25 Liles 

That is, to calculate P ^ (X^ + Y^)'^ sj , or equivalently, 

P — 25 ] , we apply the following principle from probability 

theory (refer to Central Limit Theorem on page 142 of Schaum’s Outline 
Series, Theory and Problems of Statistics ): The sum of n independent 

Normally distributed random variables forms a new random variable known 
as the Chi-Square random variable with **n degree of freedom"; this is 
denoted by 



n 

I 

i=l 



1 n 



In the specific two-dimensional case treated here, if we ptrt 

2 

= JL. X and X„ = ^Y, then I X^ = Xl- 



10 



10 



i=l 



The probability is now denoted by 

P [x2 + y 2 :f 25 ] = P [l0^x2 + lo2x| ilf 25 ] = P J xj + X^ ^ 2|2 j 

P + X^ ^ 0 . 25 ] , or = P^Xl - 0.25 j = F ^(0.25). 

X2 

The last term is the probability that the Chi-Square random variable 
with two degrees of freedom, is less than orequal to 0.25. This 
cumulative value is found from the "Percentage Points of the Chi-Square 
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Distribution" in Appendix A. This value, F (0.25) is found by enter- 
ing the table with n (the left-hand column) value of 2 (the number of 
degrees of freedom-- is used for n in the table), and reading directly 
to the right until the column is reached that has the tabulated value 
closest to 0,25, That column is then followed vertically upward until 
the top row is reached--the row is labled as Q. The value in this row 
and column is the complement of the desired probability, and it is 

Q (0.25)= 0.90 

1 - Q (0.25) = 1 - 0.90; therefore 
^2 

F w2 (0-25) = 0.10. 

A2 

Now, consider the problem of finding the probability that a 
submarine will be ’’killed'* by a moored mine if it is known that the kill 
capability of the mine is a function of the radial distance between the 
mine and the submarine. This is a three dimensional problem, as the 
submarine depth is also considered. The kill function is known to be 
certain if the submarine is within 30 yards of the mine, and the 
probabilities that the submarine will pass within a given range in each 
of the three coordinate directions (depth, "North-South,” and "East-West") 
are independently distributed N(0,625) in yards. The problem is to 
calculate P (X^ + + Z^) ^ 90o] . 

Let Xi = JL_ = JL 
CTX 25 

X = = _1 

' CTy 25 

Xo = Ju-. = _z 
CTz 25 

(This technique is that used to normalize and standardize a Normal random 
variable), then 
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'^2 o 

I X. = 

i=l 



So that, 



p + z2 ^ 30 j = P [ 2s2(x2 + x| + X^) 900 ] = 

P f + x2 + X^ - -2M ] = P [ X^ + X^ + X^ 1 . 44 1 = 

L 1 2 3 .,2 J I 1 2 3 J 



F o (1.44) . 

^3 



From the table on page A-3, 



Q 2 (1-44) = 0.75, and 
F , (1.44) = 0.25 . 



Hence, under the assumed probability distributions, the probability that 
the submarine will be **killed** is 0.25. 

There are other methods of solving this problem which involve working 
with the random variables X and Y; however, the Chi-Square method is 
easier, and it is convenient because the probabilities associated with 
it can be simply obtained from a precomputed table. 

The following problems will help familiarize the student with the 
use of the Chi-Square distribution and Appendix A: 



Find P ^ 


- 21.5] 
20 J 


= F (21.5) 

20 


Answer: 


0.75 




Find P j 






Answer : 


0.25 





3. Find the value of X for which *^25^ 

Answer: 29.3 
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4. An antiaircraft shell designed to burst at a specified point in 
space has x, y, and 2 errors in a coordinate system with origin at the 
specified point which are independent Normally distributed random 
variables X, Y, Z, each with mean 0 and standard deviation 50, with 
length measured in feet. Find the probability that the radial error in 
the burst point is at most 100 feet. 

Answer: 0.75 

EXPONENTIAL DISTRIBUTION 

An exponential function is a function of the form f(t) = e^, where 
e is a constant which is very much like 7T , except that e is 
approximately 2.7183, vice 3.1416. For example, if t = 2, then 
f(2) = = (2.7183)^ = 7.3891, as seen from the last entry opposite 

X = 2,00 in the table of Appendix A, page A-2, The specific exponential 
function of interest here is the negative exponential or f(t) = e ^ (see 
Appendix A, page A-2 for values of e“^) which is the reciprocal of the 
positive exponential shown before, or f(t) = 1/e^. 

The Exponential distribution itself is actually f (t) = A ^ 

X 

(t= 0) where A > the lower case Greek letter "lambda,” is a positive 
constant. It turns out that 1/A is the mean of this distribution. The 
graph of the Exponential distribution is shown in Figure 28. It is seen 
in the figure that the function (the Exponential distribution) "decays" 
with the progression of time. 
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The Exponential distribution can be used to describe many phenomena 
connected with military applications. An example where such a function 
occurs is the MTTF (Mean Time To Failure) of an electronic equipment. 

The time axis represents the total lifetime of the equipment itself. 

The random variable X represents the total lifetime; that is, to say that 
X assumes a value t^ means that the equipment fails at time t^. 

The cumulative distribution function is 



Therefore, the table in Appendix A, page A-2, is still usable, except 
that the value found in the table must be subtracted from one. Figure 29 
shows the Cumulative Exponential probability distribution. 



Figure 29 : The Cumulative Exponential Dietrioution Function 

Practical ASW applications of the Exponential probability distribution 
are predicting: (1) the mean time to trail a submarine and (2) the mean 

time to fail for ASW equipment. 

The following problems show some of these applications. 

1. Suppose that it is desired to trail a submarine that transits a 
barrier. Take the time that the submarine enters the barrier to be 
t = 0, and let be the probability that trail is initiated at t = 0, 
where P^ is known. In this problem A is the reciprocal of the mean 
time to lose trail (or the average rate of losing trail). For this 



F (t) = 1 - e: 
X 




Fx(t)l^ 
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problem let 1/A be 20 days, and let = 0.5. What is the probability 
the submarine is trailed at time t = 10 days? Since 1/A = 20, 

= 1/20 = 0.05; therefore, 

P(t) = (1 - e‘ A t) = 0.5 (1 - = 

0.5 (1 - 0.6) = (0.5) (0.4) = 0.2 

2. A device is to operate until failure. The pdf in a model describ- 
ing the time of failure is f(t) = ^”^^0 < t, where t is in hours. What 
is the probability the device will fail in the first two hours? 

Answer: In order that e"^ be of the form A e ^ A must equal 

one, that is, the cdf is F(t) = 1 - e”^. Thus P ^Fail in first 2 hrsj = 

1 - e"2 = 1 - 0.135 = 0.865 

3. A resistor is installed in a fire control system. The pdf 

describing the time to failure of the resistor is f(t) = Ae ^^y0<t 

where t is in years. If the MTTF is 15 years what is the probability the 

resistor will last 6 years? 

A = J: — = ^ = 0.07 
MTTF 15 

P ^Resistor last 6 years j = 1 - P ^Resistor fails in 6 years j = 

l-^l-Ae"‘^*^] =l-[l - 0.07 ] = 

1 - [ 1 - (0.67) (0.3)] = 1 - 0.8 = 0.2 

4. The reliability of a device during a mission depends on T, the 
time the device has been in storage before the start of the operation. 

In a probability model, A is the event the device survives the mission, 
£(t) = e’^ is the pdf of T. Find P(A) for a 6 month mission, where T is 
in years. 

F(t) = 1 - e"*^, P(A) = 0.6 
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POISSON DISTRIBUTION 



Many physical phenomena can be described by the Poisson distribution. 
Emission of particles from a radiating source, arrivals of messages in a 
CIC, and submarines transiting a barrier are a few of the shipboard ASW 
situations that may require this distribution for analysis of the 
problem. 

The basic notion of this distribution involves a stream of **cus tomers*' 
arriving at a "service point" at a constant average arrival rate. It is 
assumed also that arrivals are independent in the sense that the number 
of arrivals in any given time interval is independent of the number of 
arrivals in any other interval which does not overlap the first. 

The Poisson distribution is given by 

p(k, A ) = A ^ e-^ . k = 0, 1, 2,... 

kl 

where A 0 is some constant. 

In the applications of the Poisson distribution that are of primary 
interest in ASW, the Poisson distribution represents the number of 
"arrivals" in the given time interval. For example, if time is 
measured in hours, the given time interval is a certain ten-hour period, 
and A = 0.5, then the notation 

p(2,0.5) = ( 0 . 5 )^ = o.076 

2 ! 

is interpreted to mean: 

"the probability of exactly two arrivals in the given time period is 

0 . 076 ." 

The length of the time interval is not apparent in the expression 
defining p(k. A). Suppose the basic unit of time (this may be a second, 
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a minute, an hour, a day, a year, etc.) has been chosen. Let T be the 
length of the given time interval, i.e. the number of basic time units. 
Then the A in the Poisson distribution has the property that A /T is 
the mean arrival rate, expressed as the number of arrivals per unit of 
time. That is, if we denote the mean arrival rate by Q > then 

a = A/t, or A = a T. 

Therefore, for applications other than the Binomial approximation 
indicated next, we use 

p(k, a T) = e . q _.. T i h , k = 0, 1, 2, ... 

kl 

to express the probabilities for the various possible number of arrivals. 

This distribution has a mean, jU = A , variance, = A and 

standard deviation, o' - VT . 

In addition to having its own applications, the Poisson distribution 
also provides us with a close approximation of the Binomial distribution 
for small k, provided that p is small and A = np. This is indicated 
in Table 7 where n = 100, p = 1/100 and A = np = 1. 



k 


0 


1 


2 


3 


' "i ■ ■ ' ' 

4 


5 


Binomial 


0 . 366 


0.360 


0.185 


0.0610 


0.0140 


O.CO 29 


Poisson 


Oocc 


0.368 


0.184 


O.O 613 


0.0153 


0.00307 



Table 7; Comparison of Binomial and Poisson Values 



The Poisson distribution can probably best be understood through 
examples and problems. 
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1. Find: (i) (ii) 

From Appendix A, page A-2, 

(i) e-l*3 = 0.273 

(ii) e-2.5 = 0.0821 

2. For the Poisson distribution p(k, X ) = X^ e~^ find 

kl 

(i) P(2,l), (ii) p(3,^), (iii) p(2,.7) 

(i) p(2,l) = = Bll = 0.368 = 0.184 

2:22 

(ii) p(3,^) = 0.013 

(iii) p(2,.7) = 0.12 

3. Suppose that in a particular region submarine detection times 
obey a Poisson distribution, with a mean detection rate of one submarine 
every 12 days. Calculate the probability that five submarines will be 
detected in the operating area during a quarter (120 days)? 

As indicated above, A = fl T, where Q represents the average 
detection rate (i.e. the average number of detections per day, which is 
1/12), and T the length of the given time interval, so that T = 120). 
Therefore, 

pfK = 5] = e~ °^ ( g T)*^ = e -(1/12)(120) ( 1 /^ 2 ) (120)^ = 

•• •’ k! 5 : 

0.03783 

4. Using the information from problem 3, find the probability of 
detecting five or fewer (at most five) submarines during the quarter. 

pfK=5l = I [(1/12) (120)] ^ e-(Vl2) (120) ^ q 

J k=0 k.' 

5. Approximate the Binomial distribution problem number 4 in the 

Binomial distribution section of this chapter, using the Poisson 
distribution. Answer: 0.9 
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